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HUMAN ESCAPE LEARNING IN RELATION TO 
REINFORCEMENT VARIABLES AND 
INTERTRIAL CONDITIONS ! 


JAMES H. STRAUGHAN? 


Indiana University 


The present experiment was de- 
signed to study human escape learning 
in a simple instrumental situation. 
In an escape learning procedure, S 
was required on each trial to choose 
one of two responses, with correction 
allowed, in order to turn off an 
auditory stimulus. <A feature of such 
a design is that it meets the require- 
ments for analysis in terms of the 
statistical learning theory that has 
been applied to verbal conditioning 
experiments (3, 5); therefore we can 
test the extension of the theory to 
another situation. Finally, we may 
study the effects of stimulus intensity, 
a variable which may be important in 
escape learning although it is not 
taken into account in the statistical 
theory, and also certain questions 
concerning intertrial conditions raised 
by the verbal conditioning experi- 
ments. 


This study was taken from a dissertation 
submitted to the Psychology Department of 
Indiana University in partial fulfillment of the 
requirements for the Ph.D. degree in psychology. 
The writer is greatly indebted to Professor W. K. 
Estes under whose direction the investigation 
was conducted. 

2 Now at Montana State University. 


The statistical theory, as applied to 
verbal conditioning, is intended for 
situations in which (a) the organism 
undergoes repeated trials with the 
same situation, as experimentally 
defined, being presented on every 
trial; (6) the responses available to 
the organism are categorized by 
experimental criteria as members of 
mutually exclusive, exhaustive re- 
sponse classes; and (c) reinforcement 
is such that each trial has a fixed 
probability of terminating with a 
member of a given response class. 
The data should be expected to 
correspond to the model in two major 
features. First, the probability of 
responding should change in accord- 
ance with a negatively accelerated 
curve running from the initial proba- 
bility to the final asymptote. Second, 
the asymptote which is approached 
should be equal to the probability 
that the response will be reinforced 
when it occurs. 
have 


These characteristics 
appeared in animal T-maze 
studies (4) and verbal conditioning 
experiments (3, 5). A third feature 
has been found in the verbal con- 
ditioning experiments to which the 





2 JAMES H. STRAUGHAN 


theory has been applied: the rate of 
approach to asymptote appears to 
depend upon the difference between 
initial response probability and asymp- 
totic probability, with the faster 
rates being associated with the larger 
differences. On the hypothesis, pro- 
posed elsewhere (3), that this third 
feature depends upon variables as- 
sociated with massed trials, we should 
expect that the degree to which it is 
manifested in the present experiment 
will be a function of the isolation of 
trials from one another. Although it 
may not be possible to achieve an 
experimental situation in which trials 
are strictly independent, some control 
may be exerted over the degree to 
which trials are isolated from one 
another by requiring Ss to perform a 
distracting task between trials. 


MetTuop 


Design.—The experiment used escape learning 
with a tone as the reinforcing stimulus. On 
each trial, only one of the two possible responses 
would terminate the tone. The dependent 
variable was the relative frequency with which 
one of the response classes, A,, occurred first on 
each trial. ‘Trials on which the A, response was 
correct are termed FE, trials; others are termed 
E, trials. All Ss received 160 trials, divided into 
two series, A and B, of 80 trials each. Three 
independent variables with two values of each 
variable made an eight-group factorial design 


TABLE 1 


Experimenta Desicn 





Probability 
of Rein- 
forcement 

(wr) Series A 


Stimulus 
Intensity 


Distracting 
Task 








Absent 
Present 
Absent 
Present 
Absent 
Present 
Absent 
Present 














es 


Note,——Probability of reinforcement during Series 
B was .30 for all groups. 


appropriate, as summarized in Table 1. The 
independent variables under investigation were 
(a) the probability that an A, response would be 
followed by termination of the tone on any trial 
(70% or 100% in Series A, 30% in Series B), 
(b) intertrial conditions (distraction present or 
absent), and (c) the intensity of the tone (80 or 
125 db). 

In order to control for possible effects due to 
differences in sequences of E, and EF, trials, all 
Ss within a group were given different rein- 
forcement sequences, although the same 16 
random sequences were used for all groups. In 
randomizing sequences, the probability of an 
E; trial was fixed, although the proportion of FE, 
trials in a series was not. ‘The particular re- 
sponse to be designated A, was chosen at random 
for the first S in each group and alternated for 
subsequent Ss. 

Apparatus.—A_ sound-insulated room was 
used with a small fan to provide a masking noise. 
Illumination was from a 40-w. centrally located 
bulb. To S$ the apparatus appeared as a gray 
panel, 35 in. wide by 24 in. high. The auditory 
stimulus, an impure tone of 500 cycles, was 
produced by an Army surplus signal generator 
and presented to S by an Army surplus headset. 
Neither of these pieces of equipment had a 
manufacturer's identification. The impurities 
of the tone were not analyzed, although they 
seemed to be largely harmonics. The soft tone 
and the loud tone were respectively 80 db and 
125 db (re .00025 dynes/cm*). On the front of 
the apparatus and 16} in. above the table top 
was a black knob 2] in. in diameter with which 
S could turn off the tone when it cameon. This 
knob was kept centered by a small spring and 
friction device and required a 70° turn either to 
the left or to the right in order to turn off the 
tone. The force required for turning the knob 
was 20 lb. applied tangentially to the circum- 
ference of the knob. 

Directly below the knob on the apparatus 
panel was a horizontal slot through which S$ 
could see a small black spot when a 7-w. pilot 
light was on. Also in the slot but in front of 
the black spot was a small vertical pointer which 
S could move from side to side by means of two 
handles. When the black spot moved, S could 
keep the pointer in front of it by moving the two 
handles back and forth. This tracking task is 
the distracting task mentioned elsewhere in this 
paper. The slot was 10} in. above the table top, 
vx in. high, and 44 in. wide. ‘The moving black 
spot was actually a yy-in. wide, irregular, wave- 
like black line drawn freehand on a roll of white 
paper. This paper was moved behind the slot 
by an Esterline-Angus chart drive so that the 
line appeared to S as a moving black spot. 
Between the paper (about yy in. behind the slot) 
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and the slot was the vertical pointer, made of 
Brown and Sharpe No. 30 piano wire, which was 
fastened directly to a bar joining the two con- 
trolling handles. These handles, of § in. clear 
plastic rod, were 194 in. apart, 84 in. above the 
table top, and extended | in. through slots in 
the front of the apparatus. They were so 
placed that S could rest his arms on the table 
while controlling them. Behind the panel and 
out of sight of S during the experiment were 
devices with which E controlled the apparatus. 


100 
GROUP | 


60 


60 pe eee 
Pfr’) 700 
40 -(13e.709" 


20 P, n’)= 300 


° 


ie 


“<23exese ' 


Subjects.—The 128 Ss were obtained from 
various spring and summer session classes on the 
Indiana University campus. Volunteers for the 
most part, they were right-handed males and 
females of normal college age and over. They 
were assigned randomly to the experimental 
groups. 

Procedure.—Each S given the distracting 
task was told that the study concerned the effects 
of a periodic interruption on tracking. He was 
told to keep both hands on the handles con- 
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10 12 14 
BLOCKS of 


Fic. 1. 


Observed and theoretical proportions of responses per block of 10 trials (n’), 


6o 2 4 6 
TEN TRIALS (n') 


The 


theoretical curve is in terms of pio(n’), the average probability of an A, response in the n’th block 


of 10 trials. 
the other groups and series. 


A theoretical curve for Group II, Series A, was not obtainable by the methods used for 
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trolling the pointer except when he heard the 
sound in the earphones. When he heard the 
sound, he was to turn it off with his right hand. 
The instructions for the groups which did not 
have the distracting task were similar. They 
were to keep their hands at rest on the handles 
during the experiment except when they were 
turning off the sound. 

The trials began approximately 30 sec. after 
E and S were prepared. ‘Thereafter, the sound 
came on for each S every 15 sec. for 160 trials 
without pause. To keep Ss from long delays in 
turning off the sound, the following procedures 
were followed. If S did not turn off the sound 
on the first trial within 15 sec., he was asked to 
turn off the sound by turning the knob. In 
some cases Ss failed to make the corrective 
response on the first trial that correction was 
necessary. ‘These Ss were told at the end of the 
15-sec. interval to turn the knob the other way 
if the first way did not work. In no case was an 
S given either of these additional instructions 
more than once. When S took longer than 15 
sec, to turn off the sound, the next trial did not 
begin until the end of the 15-sec. interval in 
which he turned it off. 


Resu_ts AND Discussion 


Terminal response probabilities.- 
The data for each group in terms of 
the proportion of A; responses per 
block of 10 trials are plotted in Fig. 1. 
There are two predictions concerning 
observed final levels and theoretical 
asymptotes to be tested: (a) the 
observed should not differ signifi- 
cantly from the theoretical values, 
and (6) observed values which cor- 
respond to the same theoretical asym- 
ptote should not differ significantly 
from each other. In order to make 
these comparisons, the proportion of 
A, responses for each group in the last 
20 trials of each series (A and B) was 
taken as the observed asymptotic 
value; these values are presented in 
Table 2. 

Before making tests of the corre- 
spondence between observed and theo- 
retical values, a statistical check is 
called for in order to determine 
whether or not the groups have 
arrived at stable asymptotes of re- 


TABLE 2 
Mean anv VARIANCE OF Proportions or A; 


Responses in THe Last 20 Triats 
or Eacu Series 








Series A Series B 





Group ; 
Vari- Mean Vari- 


ance 


Mean 


%-30% | 
(LA)* | .688| .038 332 
(LP) .788| 032 | 581 | . 
(SA) | 684) 040 | 294 | , 
(SP) 697 | 054 | 509 | 


100% -30% 
Vv (LA) 
VI (LP) 


| 997 
Vil (SA) 

| 

! 


425 | 


“1.000 | 322 | 


1,000 | . 328 | 
1,000 | 000 | .372 


Vill (SP) 





* Land S stand, respectively, for the loud (125 db) 
and soft (80 db) tone; A and P for the absence and 
presence of the distracting task. 


sponding. A convenient method is 
to examine the difference between the 
numbers of A; responses in the last 
two blocks of 10 trials for each series 
and group. Analyses of variance 
were made separately for (a) the 
70%-30% groups, Series A, and (6) 
all groups, Series B. None of the 
F’s were significant and the mean 
differences were not significantly dif- 
ferent from zero. 

The analysis of variance of the 
observed asymptotes for the 70% 
30% groups in Series A gave no sig- 
nificant F’s. The within-cells vari- 
ance from this analysis was used in 
testing the difference between each 
of the group values and the corre- 
sponding predicted value. None of 
these t’s approached significance. For 
the 70%-30% groups in Series B the 
distracting task is significant at the 
OO1L level (F = 16.92); the groups 
that had the distracting task showed 
more responses during the last 20 
trials than the other groups. The 
t’s, computed as above, show that the 
proportions of responses in the last 
20 trials of Groups II (t = 4.98, 
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P= 001) and IV’ (t = 3.70, P 
= 01), the groups with the dis- 
tracting task, are significantly above 
the predicted asymptotic values. 
The analysis of variance of the 
observed asymptotes for the 100% 
30% groups in Series B showed none 
of the F’s to be significant, although 
computation of the ?’s as above 
showed that the proportion of re- 
sponses in the last 20 trials of Group 
VI is significantly above the predicted 
asymptotic value (tf = 2.70, P = .02). 
No tests were made of the asymptotes 
of the 100%-30% groups, Series A, 
since it is obvious that they conform 
closely to the predicted values. 

We may conclude that the observed 
asymptotes behave as expected in 
Series A but that there are some 
deviations in Series B. These devi- 
ations occur predominantly in those 
groups receiving the distracting task 
and are all the theoretical 
asymptote. The interpretation may 
be either that the frequency curves 
are approaching a higher asymptote 
or that they are approaching the 
predicted asymptote but at a slower 
rate. In favor of the latter inter- 
pretation, we note that those groups 
that show a significant deviation 
from the predicted asymptote in 
Series B are still decreasing in the last 
two blocks of trials. 

Learning rates.—Differences in re- 
sponse totals may be taken as evi- 
dence for differences in learning rates 
when both initial response levels and 
asymptotes can be assumed constant 
in the given comparison. Since much 
of the learning seems to have occurred 
early in the series, the comparison of 
learning rates was made by using 
mean numbers of responses in the 
first 20 trials rather than means taken 
over the total series. The mean 
number and the variance of A, 
responses made by each group in the 


abe ve 


first 20 trials of each series are shown 
in Table 3. Analyses of variance for 
the 70% and 100% groups were done 
separately for Series A and again for 
Series B. For the 100% groups in 
Series A the data were transformed 
by the formula, y = e®, 
Bartlett’s test indicated that the 
Variances were not homogeneous. 
The significant results of these analy- 
ses were as follows: (a) the inter- 
action between sound intensity and 
distracting task is significant for the 
70%-30% groups in Series A (F 
= 6.09, P = .05); (b) the effect of 
the distracting task is significant for 
the 70%-30% groups in Series B 
(F = 9.01, P= 01), and for the 
100%-30% groups in Series 4 (F 
= 11.1, P= 001) and Series B 
(F = 4.01, P = .05). 


because 


Thus, we conclude that the 
distracting task increased the number of 
A, responses during the first 20 trials for 
all groups in Series B, and decreased the 
number of A; responses during the first 
20 trials for the 100%-30% groups in 
Series A. Both of these effects were in 
the direction of slower learning for the 
groups with the distracting task. The 


may 


TABLE 3 


Mean Numper ann Variance or A, 
Responses in THE First 20 Triars 
or Eacu Series 


Series B 


Group | } 
Vari 


ance 


70%-30% 
I (LA)* 
Il (LP) 
Ill (SA) 
IV (SP) 


100% 30% 
V (LA) 
(LP) 
(SA) 

I (SP) 


11.50 
14.00 
12.93 
1.19 


9.50 
14.24 
10.32 
12.00 


18.13 
15.97 
1K.RG 


21.07 


19.13 
17.81 
1k.43 
16.81 


10.73 
14.67 
10.67 


9.33 


VI 
Vil 
Vil 


* See Table 2 for meaning of letter evmbol« 
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trend for the 70%-30% groups, Series 
A, was not consistent. The interpre- 
tation of the significant interaction 
between effects of the distracting task 
and sound intensity in the 70%-30% 
groups, Series A, is made difficult by the 
possible differences in starting points, 
i.c., differences in f(0). There is no 
evidence that the independent variable 
of sound intensity had any effect upon 
the learning rate aside from this inter- 
action effect. 

Parameter estimates.—An analysis of 
the results further in terms of statistical 
learning theory requires reference to 
aspects of the model that are applicable 
to the present experiment. From as- 
sumptions discussed elsewhere (1, 2, 3) 
the following equation for response 
probability as a function of trials can be 
derived: 


p(n) = 9 — (x — p(0)J1— 6)", (1) 


where f(n) is the mean probability of an 
A; response on trial » + 1 and @ is a 
parameter describing learning rate, or, 
in terms of statistical learning theory, 0 
is the proportion of stimulus elements 
sampled on each trial. 

In order to account for the possibility 
that successive samples of stimulus 
elements are not independent, we may 
introduce two parameters, u and 4 to 
replace 9, where uw is the average proba- 
bility that an element is in the sample on 
a trial and 4 is the probability with which 
an element remains in the sample from 
the preceding trial. We should expect 
that insertion of a distracting task 
between trials should decrease the size 
of u, the size of 4, and the difference 
between 4 and u, with 4 being larger 
than uw. It can be shown (6) that, with 
the additional parameter, Equation 1 
takes the form 


p(n) = x — [x — p(0)) 
KM (1 — d)(1 — we". (2) 


Equation 2 can be put into the more 
convenient form 


Piw(n’) = « — [e — pwl(l)] 
> 4 (1 — 4)ien’=1) (3) 


TABLE 4 


Estimates or u AND b ror Eacu Series 





Group 
Series | Series Series 
KB A B 





O41 

O12 | 
026 | 
OOS | 


(SP) 
100% -30% 
Vv (LA 





A) | 020 
VI (LP) 076 | 008 | 
VII (SA) 183 | .035 | 
VIII (SP) | O12 | 


* See Table 2 for —z of letter symbols 
t Value of u for Group II, Series A, not obtainable 
from data. 


where pio(n’) is the average probability 
of an A; response in the n’th block of 10 
trials. 

The values of u obtained by applying 
Equation 3 are given in Table 4. For 
algebraic reasons no solution is obtain- 
able for Group II, Series A; therefore no 
theoretical curve is fitted to this data in 
Fig. 2 Also, values of u are not ob- 
tainable for many individual Ss; we 
cannot, therefore, use analyses of vari- 
ance based on individual u’s to test 
group differences. An alternative 
method of testing the significance of 
effects of experimental variables upon 
u values is to compare groups with 
respect to learning rates for each of the 
series separately. This has been done 
in the preceding section and the results 
confirm predictions concerning the size 
of u. 

The values of 6 obtained for each group 
by finding the difference between the 
conditional probabilities (3) are shown 
in Table 4. Because E, trials did not 
occur during Series A for the 100%-30% 
groups, values of 4 for these groups are 
not obtainable from the data. Our 
expectation that 4 should be smaller for 
the groups that had the distracting task 
is confirmed by Table 4. Analyses of 
variance were computed for the 70%- 
30% groups, Series A and B, as shown 
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TABLE 5 


Anatyses or VARIANCE OF b ann (b-u 
Series 


Vatues ror tHe 70%-30% Groups, 
‘ano B 


Source 


Between groups 
Sound intensity 
Distracting task 
$.I. xX D.T. 
Between Ss, within groups 
Series 
Series X Groups 
Series X S.I. 
Series X D.T. 
Series X D.T. X S.L. 
Ss X Series, within groups 


* Significant at .05 level. 
** Significant at .01 level. 


in Table 5, and for all groups, Series B, 
as shown in Table 6. The effect of the 
distracting task is significant at the .01 
level for the 70%-30% groups in Series 
A and B (F = 8.29) and for all groups 
in Series B (F = 10.00). The groups 
that had the distracting task showed 
smaller values of 6. In the analyses of 
Table 5, the between-Ss within-groups 
variance estimate was used as the error 
term in computing the between-groups 
F’s, and the Ss-by-series within-groups 
variance estimate was used as the error 
term for the series and series-by-groups 


| Mean 


Square 


Mean 


Square 


2451 
0241 
DS2 
1059 
0730 
0100 
0305 
0578 
0336 
0000 
0262 


2482 +.31* 
O16 154 

5468 7.271°* 
1863 2.477 

0752 
0000 
O438 


—— oe 


oO 
— 


000 
3.532° 
1203 9.702°* 
0063 508 
0049 395 
O14 


——— eo 


~ 
~— 


F's, The size of 6 appears to be inde- 
pendent of sound intensity and series; 
however, there is a significant difference 
between the 70%-30% groups and the 
100% -30% groups in Series B. 

Another indication of the degree of 
independence of stimulus samples drawn 
on successive trials should be the size of 
differences between 6 and uw. If 4 and 


u are equal, then successive samples are 


presumably statistically 
and the mathematical model we are 
using reduces to the simpler model 
where @ is equal to 46 and uw. If, then, 


independent, 


TABLE 6 


Anatyses or Variance or b ann (b-u) VaLues ror att Grours, Seaies B 


Source 


Within groups 
Between groups 
Sound intensity 
Distracting task 
Reinforcing conditions 
am aik 

S.1. X R.C. 

iy ee @ Fo 

5. X Dt. X BL. 





Mean 


Square Square 


O58} 
1827 
0123 
5827 
AIDS 
O517 
0805 
0397 
0123 


OS84 

1647 2.437° 
Ols4 229 
A122 7.058** 
5432 9. s1"* 
O4RX KY, 
0922 1.579 
0078 134 
0421 721 





* Significant at .05 level. 
** Significant at .01 level, 
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(6-u) is smaller for those groups having 
the distracting task, we may accept this 
as evidence supporting the mathematical 
theory. In order to examine the effect 
of the distracting task upon the size of 
(4-u), individual values of (d-u) were 
obtained by subtracting the value of u 
obtained for a group from the value of 
4 obtained for each individual § in the 
group. Analyses are shown in Tables 
5 and 6. The error terms used in the 
analysis of (4-u) in Table 5 are the same 
as those used for 6. The effect of the 
distracting task is significant at the .01 
level for the 70%-30% groups in Series 
A and B (F = 7.27) and for all groups 
in Series B (F = 7.06); the groups with 
the distracting task showed smaller 
values for (6-u) than the groups without 
the distracting task. Not predicted in 
the model are the significant interaction 
between series and sound intensity in 
the 70% -30% groups, and the significant 
difference between the 100%-30% 
groups and the 70%-30% groups in 
Series B. 
SuMMARY 


An experiment was conducted to study two- 
choice escape learning in a simple instrumental 
situation with human Ss. The experimental 
design grew out of an analysis of the situation in 
terms of the statistical learning theory that has 
been applied to verbal conditioning experiments 
(3, 5). 

Eight groups of 16 Ss each were used in a 
three-variable factorial design. Two mutually 
exclusive responses were available to S on each 
trial, turning a knob clockwise and turning it 


counterclockwise; reinforcement was cessation 
of a tone following choice of the correct response. 
All Ss received 1 uninterrupted trials spaced 
at 15-sec. intervals. The independent variables 
were probability of reinforcement, presence or 
absence of a distracting task between trials, and 
intensity of the tone. It was found that termi- 
nal response probability tended in most cases to 
match probability of reinforcement; learning was 
generally slower with the distracting task; and 
stimulus intensity had little effect. Predictions 
made on the basis of statistical learning theory 
appear to be largely verified. 
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In a previous study (2) performance 
on a second task was studied as a 
function of whether the S-R pairs to 
be relearned were the same or different 
from those learned in the first task 
and of instructions emphasizing speed 
of responding. A significant inter- 
action found based on poorer 
performance under the speed con- 
dition on those S-R pairs which were 
different and a tendency for this 
condition to facilitate performance on 
chose pairs which were the same. 

Assuming that the correct response 
tendency is dominant where the S-R 
pairs are the same while the incorrect 
tendency is dominant in the case 
where they are different, these results 
were interpreted to be in accord with 
Spence’s (8) derivations from Hull 
(4) regarding performance in complex 
learning situations as a function of 
variations in the level of drive. On 
the basis of these derivations, and 
assuming that the speed instructions 
served to increase the drive level, it 
would be expected that performance 
should be deleteriously affected by an 
increase in the drive level where the 
incorrect tendency is 
However, facilitation 


was 


dominant. 
would be ex- 


pected where the correct response 
tendency is dominant. 

The present study provides further 
data relevant to these expectations. 


‘Appreciation is expressed to Robert 
Everetts, City Superintendent of Schools, Helen 
Levis, ‘Principal, Lincoln Elementary School, 
and Elsie Glassburner, Principal, Stewart 
Elementary School, Washington, Iowa for their 
assistance and cooperation in providing experi- 
mental facilities and arranging for subjects. 

* These data were collected by Lewis Lippsit. 


Dominance of either the correct or 
incorrect response tendencies is as- 
sumed to be varied in the learning of 
a second task by a modification of a 
procedure employed by Duncan (3) 
and Porter and Duncan (7). That 
is, the second task requires learning to 
re-pair half of the S-R pairs which 
have been learned in the first task 
while the other half is left unchanged. 
The method for varying the level of 
drive differs from the previous study 
in that a modified task pacing tech- 
nique is used. It has been assumed 
by others (5, 6) that stress is a result 
of excessive task pacing. In the 
present study it is assumed that this 
method of inducing stress results in 
variations in the level of drive. 


Metnop 
Subjects.—A criterion of less than 15 errors 
on the first task was established, on the basis of 
preliminary work, as a basis for selecting Ss in 
the present study. Sixty-two fourth- and fifth- 
grade Ss from a total of 78 met this criterion. 
These Ss were randomly assigned in equal 
numbers to the paced (stress) and nonpaced 
(nonstress) conditions. 

Apparatus and procedure.—The apparatus 
consisted of a box 13 K 124 & 54 in., painted 
flat black. Projecting from its base was a 
sloping response panel approximately 74 K 9} 
X 13 in. Two push buttons were arranged on 
this panel on a horizental plane spaced 6 in. 
apart and 2 in. from S’s edge of the panel and 
toward its center. On the panel and so that it 
formed the apex of a triangle with the two push 
buttons serving as the points connecting the 
base was a green, jeweled reflector illuminated 
by a 6.3-v. pilotlamp. Centered 4 in. above the 
response panel was a | in. diameter aperture of 
flashed opal glass. Housed behind the aperture 
were four 6.3-v. lamps three of which had been 
colored red, blue, or amber. ‘The construction 
of the apparatus was such that by means of a 
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TABLE 1 
Errors To Cuancen ann UncHANGED 
Pains on Seconn Task 


S-R Pair 





Group Changed Unchanged 





| 3.00 
3.17 


9.58 4.35 


7.32 


Stress 
Nonstress 4.15 


} 


5.77 


rotary selector switch any single light could be 
activated and by means of a toggle switch either 
one of the two buttons could be set to turn it off. 
If the correct button was selected, the light went 
off and the green light on the response panel was 
activated. Incorrect responses affected neither 
the stimulus light nor the green light on the 
response panel. During the second task the 
amount of time the stimulus light was on was 
controlled by means of an electronic interval 
timer. The arrangement, four lights and two 
buttons, permitted construction of a task com- 
posed of four S-R pairs. Each button was 
learned to two of the lights separately. The 
selection of the two pairs to be re-paired in the 
second task was determined randomly with the 
exception that it result in each of the two 
buttons being associated with two of the lights. 

With the restriction that each light appear 
once in every four presentations, the order of 
presentation was randomly determined for both 
tasks. All Ss were given 8 presentations of each 
light in the first task for a total of 32, and 10 of 
each light in the second task for a total of 40 
presentations. In the first task a light remained 
activated until S made the correct response. 
However, in the second task only one response 
per light was permitted. No attempt was made 
to control the interval between stimulus pre- 
sentations in both tasks. 

The instructions for the first task were the 
same for all Ss. They were designed to indicate 
to § that the task required learning which of the 
two buttons turned off a given colored light. 
Upon completion of the first task a 2-min. rest 
period was introduced. Other than to require 
S to remain seated no other attempt was made 
to control his activity during this period. At 
the end of the rest period all Ss were told that 
they would not be permitted to correct for 
errors. No information was given regarding 
the fact that half of the S-R pairs would require 
re-pairing. In addition to these instructions Ss 
assigned to the stress condition were given a 
demonstration in which two lights were acti- 


vated for | sec. each and informed that failure to 
respond within this interval would be scored as 
an error. However, during the task the timer 
was set to give a 2-sec. exposure of the light and 
if a response was not made during this interval, 
the same light was re-presented until a response 
could be recorded for that trial (failures to 
respond do not constitute a part of the data in 
this study). In no case, however, did a given 
light have to be re-presented more than once. 
Only four Ss in the stress condition failed to 
respond within the 2-sec. interval on the first 
presentation of a light. For these Ss a light 
selected for re-pairing had to be re-presented a 
total of three times and a light which was not to 
be re-paired for a total of three times also. 


Resutts anp Discussion 


Error scores of the two groups for 
the 32 trials in the first task were 
subjected to a ¢ test to test the hy- 
pothesis that they were random 
samples from the same population. 
The resulting ¢ of .23 with 60 df 
permitted retention of the null hy- 
pothesis. 

Table 1 presents the number of 
errors made by each group on each 
type of S-R pair separately in the 
second task. On the basis of these 


o———-e  S'RESS - CHANGED 
o———-—-® NON-STRESS — CHANGED 

° © WNON- STRESS — UNCHANGED 
o———* STRESS ~ UNCHANGED 


a: ek cc a ee me ee ee a ee ee 








TRIALS IN BLOCKS OF FOUR 


Fic. 1. Performance curves on each type of 
S-R pair separately for stress and nonstress. 
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data it is apparent that the stress 
group performed better than the 
nonstress group on the unchanged 
pairs but more poorly on the changed 
pairs. 

Figure 1 presents these data in 
blocks of four presentations for each 
type of pair separately. With the 
exception of the fourth block, it can 
be seen that the performance of the 
stress group is consistently superior 
to that of the nonstress group on the 
unchanged pairs and inferior, without 
exception, on the changed pairs. 

Table 2 presents a summary of an 
analysis of variance based on the data 
presented in Fig. 1. This analysis 
indicates that the main effects of 
stress are not significant. The effects 
of re-pairing as well as those due to 
blocks are highly significant. The 
significant interaction between Blocks 
and Pair, while not of particular 
interest in the present study, is 
apparently due to the overlapping 
between the performance curves of 
the changed and unchanged pairs of 
the nonstress group. It is the sig- 
nificant between Stress 
and Pair, however, that is of central 
importance. 


interaction 


Viewed in conjunction 
with the data presented in Fig. 1, this 


TABLE 2 


Anatysis or VARIANCE oF Frror 
Scores on Seconn Task 


Source 


Stress 

Error (b) 
Blocks 

Pair 

Blocks & Pair 
Blocks X Stress 
Pair X Stress 
Blocks * Pair X Stress 
Error; (w) 
Error, (w) 
Error, (w) 


*P < 025 
-P < wl. 


interaction may be interpreted as 
indicating that the effects of stress are 
contingent on the type of S-R pair 
involved. That is to say, stress 
results in fewer errors, in comparison 
to nonstress, on the unchanged pairs 
and in more errors on the changed 
pairs. Since the triple interaction 
did not indicate that the magnitude 


of such effects varied with blocks, 


tests of the simple effects of stress are 
based on the means shown in Table 1. 
These indicated that the difference in 
errors on the unchanged pairs was 
significant at only the .20 level and at 
the .10 level on the changed pairs. 


One of the properties commonly 
ascribed to drive states (1) is that they 
serve to combine with and enhance the 
strength of those response tendencies 
aroused by the stimulus situation. 
Moreover, Spence (8) has outlined the 
theoretical considerations which neces 
sitate control, in those situations where 
more than response tendency is 
aroused, of whether the dominant tend 
ency at the time drive is increased is 
correct or incorrect for the task. Other 
things being constant, and provided this 
factor can be specified, cither improve 
ment or impairment of performance may 
be predicted as a function of stress on 
the assumption that it serves to increase 
the level of drive. In the present situ 
ation, then, it would be expected that 
stress should facilitate performance on 
the unchanged pairs and impair it on 
the changed pairs. The results did tend 
to confirm these expectations. 

A variety of methods have been used 
for varying stress in complex learning 
situations and a frequent finding is for 
performance to be impaired under stress 
(5). It is quite reasonable to presume 
that the different methods employed 
contribute their own unique set of stimuli 
to the total stimulus complex. As 
sociated with such _ stress-produced 
stimuli are previously learned or un- 
learned responses which may 
tagonistic and/or 


one 


be an 
incompatible with 
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those required for correct performance. 
Hence, quite aside from its assumed 
intensifying property, should such in- 
compatible tendencies become dominant 
in the situation, it would seem reasonable 
to expect general impairment of per- 
formance under stress. Since the pres- 
ent results indicated that stress tended 
to facilitate performance on the un- 
changed pairs, it may be assumed that 
this factor was controlled in the present 
study. 


SUMMARY 


The present study was concerned with per- 
formance in a second task as a function of task 
pacing and whether the S-R pairs to be relearned 
were identical to those learned in the first task 
or were required to be re-paired. 

All 62 Ss were first given 32 trials on the first 
task. Half of the Ss relearned the second task 
under conditions of excessive task pacing. ‘The 
second task required re-pairing of half of the 
S-R pairs learned in the first task while the other 
half remained unchanged. A significant inter- 
action between these two variables was found. 
While the simple effects were not statistically 
significant they did tend to indicate that the 
stress condition resulted in fewer errors on the 
unchanged pairs in comparison to nonstress and 
in comparatively more errors on the changed 
pairs. 
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The effects of stress on performance 
is a complex and important problem 
for psychological analysis. Many 
studies indicate that experimentally 
induced stressful conditions produce 
decrements in performance (1, 3, 4, 
11). The results of other studies 
indicate that no decrement is pro- 
duced under stress (7, 17). Still 
others have found both decrements 
and increments in performance as a 
result of stress (7, 13, 17). Two 
main sources have been suggested as 
important factors affecting the re- 
lationship: (a) the method of ex- 
perimentally inducing stress and (b) 
the task to be performed. Closely 
associated with the first source is the 
problem of motivation. 

This study has three major ob- 
jectives: (a) to compare two dif- 
ferent methods of inducing stress, 
(b) to explore the relationship be- 
tween anxiety and performance under 
stressful conditions, and (c) to study 
stress and anxiety effects on the 
performance of a task of relative 
generality. 

Two methods of inducing stress 
were selected with regard to the 
classification reported by Lazarus, 
Deese, and Osler (12). The first 
method, failure stress, was induced by 
reporting false failure scores to the Ss. 


1 This experiment is one of a series of studies 
on behavior decrement performed under Con- 
tract No. DA-49-007-MD-222 between the 
University of Maryland and the Medical 
Research and Development Board, Office of 
the Surgeon General, Department of the Army. 

? This report has been prepared by T. G. A. 
and §. R. alone due to the untimely death of 
Mr. Davidson in March, 1954. 


The second method, task-induced 
stress, was produced by the rapid 
presentation of stimuli for discrimi- 
nation. The need for comparative 
studies of the different methods for 
experimentally inducing stress has 
been pointed out by Lazarus, Deese, 
and Osler (12) and by Holtzman and 
Bitterman (6). 

In recent years an increasing in- 
terest has been developing in the 
attempt to account for some of the 
differential results of performance 
under stress by seeking relationships 
between manifest anxiety and _ per- 
formance (10, 13). The present ex- 
periment also deals with this char- 
acteristic. 

Metnop 


Design.—A basic 3 X 3 X 2 factorial design 
was used with three levels of failure stress, three 
levels of task-induced stress, and two levels of 
anxiety, with three Ss per cell. This design was 
used in a manner developed by Johnson and 
Tsao (8) to the factor of 
differences. 

Task.—The performance task selected for the 
experiment was continuous, high-speed color 
naming, which was considered as representative 
of a wide variety of performances involving 
continuous 
ous 70-unit 


assess individual 


A continu 
series of five different colors in 
random order on a Kodachrome film strip was 
projected in S's view. 


visual discrimination. 


The colors were red, 
green, pink, yellow, and blue. The S’s task was 
to name each color as it appeared, 

Variables.—Task-induced stress was gen 
erated by varying the speed of presentation of 
the colors. The three speeds of presentation, 
automatically controlled, were 75, 85, and 100 
stimuli per minute. At 
stimulus was exposed for 
respectively. 


these speeds, each 
4, ae and 6 sec., 


Failure stress was induced by verbal instruc- 
tions and electric shock as an indicator of failure. 
Before the testing S was informed that per 
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formance on the task was related to mechanical 
aptitude, clerical aptitude, and intelligence test 
scores. He was also told that the apparatus was 
set to deliver automatically an electric shock for 
poor performance at the end of each test period. 
Each S was then given the same moderat* shock 
as a demonstration of what to expect for failure. 
What constituted “failure,” punishable by 
shock, was different for each of the three groups. 
The “slight failure” group was told that only 
if their performance fell slightly below the per- 
formance level of the average high school senior 
would they receive the electric shock. The 
“moderate failure” group was told that only if 
their performance fell considerably below the 
level mentioned would they receive the shock. 
The “high failure” group was told that only if 
their performance fell extremely far below the 
performance level of the average high school 
senior would they receive the shock. 

Each S was told that after 30 sec. on the task 
he would receive an auditory signal which would 
inform him of the efficiency of his performance 
up to that point. He was told that if he heard a 
bell he was doing well enough to avoid the shock. 
If he heard a buzzer, however, he was in danger 
of receiving the shock, unless his performance 
improved throughout the remainder of the trial. 

After 30 sec. on the task, the buzzer was 
sounded for all Ss, and at the end of the per- 
formance period all Ss received the electric 
shock. After the 3-min. trial a rest period of 
30 sec. was allowed in order to permit the 
distractive effects of the shock to dissipate. 
During this period, E made derogatory com- 
ments about S’s performance. The comments 
were more derisive the higher the failure con- 
dition to which the S had been assigned. The 
task was resumed, repeating the performance 
and shock periods twice for a total of three 
performance periods and three shocks. 

The shock was utilized in the failure stress 
conditions in an attempt to keep motivation 
roughly equivalent for all Ss. A momentary 
impulse shock of 225 v. was found to be fairly 
uncomfortable, but not too disrupting. This 
shock of moderate intensity was used since it 
was felt that an intense shock would destroy any 
effect produced by the failure instructions. The 
Ss might be concerned only with avoiding the 
shock and completely ignore the failure in- 
structions. However, a moderately annoying 
shock would serve to motivate S to perform at a 
high level in order to avoid it, but would not be 
so disruptive as to destroy the effects of the 
failure instructions. 

The Ss scoring in the upper and in the lower 
range on a modified Taylor Scale of Manifest 
Anxiety constituted the high and low anxiety 
groups in the experiment. The modified form 
of the Taylor scale (16) had been administered 


under the innocuous title of a Biographical 
Inventory to 234 students in an introductory 
psychology course at the University of Mary- 
land. The distribution of scores was similar to 
that found by Taylor on a similar population. 
The mean was 14.50, and the distribution was 
positively skewed with a range of scores from 2 
to 28. 

Subjects.—Twenty-seven male students 
making scores of 8 and below, and 27 making 
scores of 21 and above were selected for use as 
the high- and low-anxiety groups. ‘These scores 
were approximately one SD from the mean of 
the obtained distribution. Within each anxiety 
group Ss were assigned at random to the various 
cells of the factorial design. Each S was paid 
$1.35 for his services in the experiment. 

Procedure.—Each S was tested with the 
American Optical Company’s Pseudo-Isochro- 
matic Color Perception Test to insure that he 
was not color deficient. He was seated 2 ft. 
from a large flat black shield, in the center of 
which was a 1 X I}-in. ground-glass screen 
placed at eye level and onto which the colors 
were projected. In front of the shield was an 
Audograph conference microphone into which 
S called out the color names. Leading from the 
lower left side of the shield was a wire with an 
electrode strap. The strap was secured to S's 
left wrist by means of two brass electrodes. 

The S was given preliminary instructions 
about the importance of the task and the 
discriminations and responses that would be 
required ofhim. A practice trial was then given, 
consisting of one run through the 70-unit color 
series. The colors were projected from behind 
the shield through a 1-in. circle onto the ground- 
glass screen. In the practice trial the colors 
were changed manually after S called out the 
name of each color as it was exposed at a self- 
paced rate. When the practice trial was com- 
pleted, each S was given further instructions to 
establish the appropriate failure condition. The 
test trials were then begun. In the test trials, 
the colors changed automatically at the pre- 
determined rate. As S called out the colors, a 
recording of his responses was made. 

Each color frame on the film had a code 
number which was projected onto the ground- 
glass screen. ‘This code number was projected 
outside the aperture through which S was 
viewing the stimuli. Only E could see the code 
numbers, and he whispered every fifth code 
number into a separate microphone. In this 
manner the code numbers were superimposed 
on the record of S’s responses for later tran- 
scription and scoring. 

Immediately after the test trials were com- 
pleted, S was given a Stress Experience Inven- 
tory to complete. This inventory consisted of 
four 7-point rating scales and 25 multiple-choice 
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items relating to the amount of stress S recalled 
having experienced during the experimental 
session. This inventory was introduced to see 
if there was a relationship between subjective 
feelings of stress and both performance and the 
conditions of stress in the experiment. No 
further verbal instructions were given until after 
the inventory was completed. The £ then 
described the true nature of the experiment to 
S and indicated that he had actually done well. 
This was done in order to remove any possible 
effects of the imposed failure conditions. 


ResuLts AND Discussion 


The transcribed performance rec- 
ords for each S were scored in terms 
of errors of omission (blocking), 
errors of commission (incorrect re- 
sponse), and total number of correct 
responses. Preliminary analyses of 
these scores indicated that errors of 
commission were relatively infrequent 
and were less sensitive to variations 
in the experimental conditions. ‘Total 
correct responses merely represent the 
complement of the sum of the two 
types of errors. Errors of omission 
no response to a given stimulus——were 
used in the major analyses of results. 


TABLE 1 


Anatysis oF Variance: TRANSFORMED 
Erxronr Scores Arrer WARNING 
or PorentiaL Faiture 


Source Mean | df | MS | FP 
Speed (S) 2 
Slow 12.65 
Medium 18.79 
Fast 24.11 | 
Failure (F) 
Slight 
Moderate 
High 
Anxiety (A) 


591.26 38.07t 


| 17.22 | 

20.50 

17.82 | | 
34.18) 2.20 

17.72 

19.31 | 

610.00 | 39.32t 

| 66.83) 4.30°* 

| 77.82) 5.01% 

15.53 

Total 


*P < OS 

—-P< Ol 

+P < OO. 

+ Error term is pooled insignificant interactions 


TABLE 2 


Anatysis or Variance: ‘TRANSFORMED 
Error Scores Arrer Snock 
InpicaTION oF FatLure 


| } 
Source | Mean dj | MS I 
Speed (S) 
Slow 
Medium 
Fast 
Failure (F) 
Slight 
Moderate 
High 
Anxiety (A) 


400.61 | 16.20** 


5.54 
| 9.34 
| 14.92 | 
8.37 
11.30 | 
10.14 | 
40.98) 1.66 
9.06 
10.81 | 
638.00 | 25.80°* 
| 73.98) 2.99° 
100.18 | 4.05* 
| | 24.73 
Total Bil 


*P < MS. 
-P < OO. 
+ Error is pooled insignificant interactions 


Two separate combinations of the 
errors of omission were made: (a) 
for the three “postbuzzer” periods, in 
which Ss were operating under threat 
of failure; and (b) for the “postshock” 
periods, in which Ss were operating 
under presumed failure. The abso- 
lute scores were converted to per- 
centages, since under different speed 
conditions Ss responded to differing 
numbers of stimuli. The inverse sine 
or angular transformation (5, p. 203) 
was applied to the percentage data in 
order to permit application of analysis 
of variance techniques to the results. 

The results of the analyses of the 
transformed data on errors of omission 
are presented for the two aspects of 
performance: after signal for pos- 
sible failure (Table 1) and after shock 
indication of failure (Table 2). These 
tables also present the appropriate 
means. 


There are certain features of these analyses 
which deserve special Absolute 
replication is dispensed with as a result of 
classifying the three individuals in each cell 
with respect to their rank orders of errors of 
omission. This arrangement, devised by John- 


comment, 
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son and Tsao (8, 9, p. 311), places individual 
order as a main effect. Since there is then no 
independent comparison attributable to experi- 
mental error, the highest order interaction is 
first used to test the main effects and each of the 
other interactions. The nonsignificant inter- 
actions are pooled as the general error term (2, 
8, 9, 15). This general technique permits 
evaluation of the relative significance of indi- 
vidual differences as a separate source of vari- 
ation in the experimental results. It has seemed 
to us that this feature is of some importance in 
considering direct applications of any experi- 
mentally obtained results. The data from the 
Stress Experience Inventory are presented in 
Table 3, In this analysis the more conventional 
form of analysis of variance was used. 

Another and more limiting feature of the 
results in Tables 1 and 2 is the fact that the 
analyses were necessarily performed on trans- 
formed scores. ‘There is common effect between 
the original and the transformed scores, but they 
are not of exactly the same distribution. This 
feature necessarily places restrictions on the 
generalizations that may be made and on the 
interpretation of interaction effects (14, p. 149). 
This situation is typical of experimental data 
of the form used in studies of this type. 


As seen in the analysis tables, the 
experimental conditions clearly appear 
to have produced “‘stress”’ as we have 
defined it. This stress was reflected 
both in the performance and in 
subjective reports. On the color- 
naming task, errors of omission appear 
to be sensitive indicators of stress. 
Task-induced stress and failure stress 
conditions appear to produce effects 
upon performance, the former ex- 
hibiting a monotonic relationship and 
the latter a nonmonotonic relation- 
ship. The results also suggest that 
failure stress may dissipate in time. 
Cognitive processes may intercede to 
reduce the effects of failure stress. 

Regarding the role of manifest 
anxiety, it appears from the results of 
this study that anxiety classification 
as such does not produce differential 
effects on the continuous high-speed 
color-naming performance. How- 
ever, the several significant inter- 
actions obtained between anxiety and 
the experimental conditions indicate 
a possible “priming” function of 


TABLE 3 


Anatysis or Variance: Stress 
Experience InventorY 


Source 


Speed (S) 
Slow 
Medium 
Fast 

Failure (F) 
Slight 
Moderate 
High 

Anxiety (A) 


| 22.68 |10.23** 


8.94 
1.68 
D6 
2.77 | 


2.02 


4.42°* 


4.88* 
1.37 





anxiety for other stress effects. As 
might be expected, anxiety served to 
increase S’s susceptibility to the 
effects of stressful conditions. This 
interpretation is supported by the 
fact that anxiety conditions of the Ss 
produced significant differences on the 
responses to the Stress Experience 
Inventory. These results for the 
inventory serve to substantiate a 
basic assumption upon which the 
study was based, to the effect that 
the experimental conditions imposed 
upon the Ss would serve as stressful 
experiences. 

The highly significant results for 
the individual orders obtained in the 
experiment clearly indicate that there 
are very wide individual differences in 
the performance of the Ss on this 
experimental task. The interactions 
obtained as statistically significant are 
of value here in generating hypotheses 
for further investigation, specifically 
designed effectively to test these 
hypotheses. The speed-failure inter- 
action obtained in the three analyses 
is of particular interest. Examination 
of the tabulated means indicated that 





EFFECTS OF STRESS AND ANXIETY ON COLOR NAMING 17 


high speed and failure stress, espe- 
cially the “moderate failure’ con- 
dition, combined to produce a dis- 
proportionately large number of errors 
of omission, a form of blocking. The 
combined stress here may well be 
producing a characteristic disruption 
which can be tested separately. The 
interactions with the anxiety factor 
are more easily characterized as indi- 
cating a heightened sensitivity to the 
experimental stress on the part of the 
more “anxious” Ss, as indicated by 
analysis of the tabulated means. 


SUMMARY 


This experiment was designed to determine 
the differential effects of anxiety, failure stress, 
and task-induced stress on performance of a 
continuous high-speed color-naming task. A 
total of 54 male undergraduate students were 
selected as Ss on the basis of high and low scores 
on a modified Taylor Anxiety Scale. They 
were randomly assigned within each anxiety 
group to the various cells of a basic 3 X 3 XK 2 
factorial design. ‘This design involved three 
levels of failure stress, three levels of task- 
induced stress, and two levels of anxiety classi- 
fication. The design permitted the assessment 
of individual differences as a separate source of 
variation among the results. Errors of omission 
were used as measures of performance. 

Performance scores were studied for two 
special portions of the experimental data: (a) 
after warning of potential failure and (b) after 
shock indicating failure. A Stress Experience 
Inventory was given to each S after his testing 
session. 

The results show that errors of omission are a 
sensitive indicator of stress effects. Failure 
stress and task-induced stress produce significant 
decrements in continuous high-speed color- 
naming performance and significant effects in 
subjective reports of stressful experience. 
Manifest anxiety does not produce differential 
effects on performance. ‘The several significant 
interactions between anxiety and the various 
experimental conditions indicate a _ possible 
“priming” function of anxiety. Highly sig- 
nificant individual differences were found for the 
task used in the experiment. 
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. ANTICIPATION OF REWARD AS A FUNCTION OF 
PARTIAL REINFORCEMENT ! 


HOWARD BRAND, PAUL J. WOODS, AND JAMES M. SAKODA 


The University of Connecticut 


Brunswik (1) was among the first 
to consider the effect of varying 
probability of reward in a two-choice 
learning task. He considered three 
hypotheses. The first was concerned 
with the threshold of probability 
discrimination, the second with the 
effect of the probability ratio on 
learning, and the third with the effect 
of the probability difference on learn- 
ing. Five conditions were employed: 
100:00, 75:25, 67:33, 100:50, and 
50:00. That is, for instance, in the 
first condition, reward was obtained 
100% of the time at one side of a T 
maze; in the second, the reward was 
obtained 75% of the time on one side 
and 25% on the other, and so on. 
The hypothesis about probability 
ratio is that if the difference between 
the two percentages of reward is held 
constant, for two or more schedules, 
then the level of learning will increase 
as the ratio of the percentages in- 
creases. The probability difference 
hypothesis states that if the ratio is 
held constant, for two or more 
schedules, then the level of learning 
will increase as the difference between 
the percentages increases. 

In Brunswik’s study, the differences 
are held constant as the ratio of 
probability of reward varies in the 
following manner: 100:50, 75:25, 
and 50:00 have a difference of 50. 
The ratio in these varies from 2:1 to 
3:1 to infinity. For the constant 
difference of 50 he found that learning 
improved as the ratio increased. In 


'The present research was conducted under 
grant M-814 from the National Institute of 
Mental Health. 


a similar way, the probability ratio is 
held constant. 67:33 and 100:50 
have a constant ratio of 2:1; the 
difference varies from 34 to 50. Like- 
wise, 50:00 and 100: 00 schedules have 
a ratio at infinity; the difference 
varies from 50 to 100. Here learning 
improved as the difference increased. 

Clearly, these five conditions do not 
permit an adequate test of the hy- 
potheses because there are insufficient 
points along the difference dimension 
and along the ratio dimension. Our 
main purpose in the present experi- 
ment is to rectify this inadequacy, and 
thereby provide a more satisfactory 
test of Brunswik’s formulation of the 
influence of probability of reward on 
two-choice learning.? Specifically, we 
will report on the investigation of the 
second and third hypotheses described 
above. 


Metuop 


General design.—Sixteen reinforcement sched- 
ules were used, as shown in Table 1. They were 
fitted into a double classification factorial design 
with four levels of ratios and four of differences. 
Each of the numbers represents the number of 
reinforcements that were available on each alter- 
native per 100 trials. For example, the rein- 
forcement schedule 40:20 means that the proba- 
bility of reinforcement on the profitable side was 
40 times in 100, and the probability of reinforce- 
ment on the unprofitable side was 20 times 
in 100. 

Within a column the ratio is constant as the 
difference between the two probabilities varies, 
and within a row the difference is constant as 


* This study differs from Brunswik’s in that 
we employed human subjects while Brunswik 
used animals. ‘The reason for this change is that 
this study is part of a larger program on human 
discrimination learning. We know of no reason 
why these same predictions should not hold for 
humans as well as animals. 
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the ratio varies. Each of the 16 reinforcement 
schedules was assigned randomly to our Ss. 
Since there were five replications for each con- 
dition, the number of Ss was 80. The Ss were 
run on 120 acquisition and 60 extinction trials. 

Subjects.—The Ss were 80 undergraduate vol- 
unteers from courses in psychology at The Uni- 
versity of Connecticut. There were 55 females 
and 25 males. 

A pparatus.—The apparatus consisted of two 
levers (microswitches) at table height in front 
of S and a vertical panel containing a ground- 
glass screen at eye level and a green signal light. 
Red and white bulbs were enclosed in a box 
behind the vertical panel and flashed on the 
ground-glass screen when S pressed the levers. 
The levers were controlled independently by £ 
through a switching set-up so that each lever 
could give either color. By instruction, the red 
light was selected as the reward. ‘The reinforce- 
ment was contingent upon S’s choice of a lever 
following a given probability schedule. 

Switches on £’s control panel enabled him to 
set up one of four possible situations before each 
response of S. That is to say, the apparatus 
could be set to give red on the left lever and 
white on the right lever, or the reverse of this, 
or red on both levers, or white on both levers. 
When E had the switches set, he flashed the 
green signal light on S’s panel, and S pressed a 
lever. A light on £’s control panel told him 
which lever S had pressed. We therefore knew 
the probability of a red light on each side accord- 
ing to the predetermined probability schedule, 
the lever that S pressed on each trial, and 
whether or not reward was obtained on each 
trial. 

Instructions.—After S was seated comfortably 
in front of the apparatus, the following instruc- 
tions were given: 

“The two levers in front of you are merely 
switches which flash either red or white lights 
on the ground-glass screen. Your task is to try 
to get as many red lights as you can. But this 
lever (pressing the one set to give red) will not 
always be red; sometimes the other lever will be 


TABLE 1 


ScuepuLes or Reinrorcement CLASSIFIED BY 
Ratios anv Dirrerences 


Ratios 
Differ 


ences 


3: 6:1 


50 50 | 75:2 
40 40) @&:2 
30 30 | 45: 
20 :20 | 30: 


0:10 
45:8 
36:6 
24:4 


» 7m 
co °o 
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Fic. 1. Percentage of choices to profitable side 
as a function of probability difference. 


red. The red light will be manipulated back 
and forth from trial to according to a 
scheme or plan. On a particular trial, then, one 
lever will be correct and one incorrect. If you 
press the correct lever, a red light will flash and 
if you press the incorrect lever, a white light will 
flash. Your task then is to try to figure out the 
scheme or plan so that on each trial you can 
press the one lever that will give you a red light. 

“When the green signal light flashes, you are 
to press one lever and one lever only. Then 
wait—the green signal light will flash again 
this will be the next trial—you then press the 
lever that you think will be red this time. 
Remember, each trial you press only one lever, 
and you press the lever that you think will give 
you a red light. We can't stop once we have 
started, so do you have any questions now?” 

If there were any questions, E paraphrased 
the relevant part of the instruction, or refused to 
give additional information until the end of the 
experiment. Occasionally an S would interrupt 
in the middle of the experiment. When this 
occurred E would say, “Do the best you can.’ 


trial 


Resutts anv Discussion 


The percentage of choices to the 
most profitable side were plotted for 
blocks of 10 trials for the acquisition 
and extinction record of each S. A 
mean value was obtained for the five 
Ss in each cell (i.e., under each 
schedule of reinforcement) for the 
last four blocks of acquisition in order 
to obtain a measure of the asymptotic 
level. These values may be plotted 


* Following the convention set up by Estes 
and Straughan (4). 
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in two ways. Figure 1 shows the 
differences along the abscissa and the 
percentage of choices to the most 
profitable side on the ordinate. The 
four curves represent groups having 
different ratios, i.e., the four columns 
in the design. The curves are quite 
similar, indicating that the ratios have 
little influence in producing differences 
in asymptotic level. Figure 2 shows 
these same values plotted with the 
ratios along the abscissa and the 
percentage of choices to the most 
profitable side on the ordinate. The 
four curves represent groups with 
varying differences, i.e., the four rows 
in the design. Here the curves are 
quite different, indicating that the dif- 
ferences influence the asymptotic level. 

A double classification factorial 
design analysis of variance was also 
executed on these data. The results 
are shown in Table 2. Here the 
findings on the figures are borne out. 
The only significant variance is that 
due to the differences in frequency of 
reinforcement (P < .01). 

An analysis of the effects of differ- 
ences wnd ratios on rate of learning 
was also undertaken. The rate of 
learning was measured by linear 
regression lines fitted to the first five 
trial blocks. The assumption of line- 
arity does not seem unwarranted as 


% TO PROFITABLE SE 
3 
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Fic, 2. Percentage of choices to profitable side 
as a function of probability ratio. 


TABLE 2 


Anatysis or Variance or Asymptotic Levets 


Mean 


Source df | Square F 





1 3 | 2356.88 | 19.59° 
2. Ratios 3 95.62 | 0.79 
31%K2 9 | 110.14 | 0.92 
4. Within 64 | 12031 


. Differences 





5. Total | 79 | 


*P < OO. 


evidenced by the empirical curves 
plotted in Fig. 3. The analysis per- 
formed with the regression data was 
an analysis of the difference between 
regressions as outlined by Snedecor 
(5). First of all a test was made of 
the significance of the average re- 
gression to determine whether, in 
general, the rate of learning was 
significant or not. Following this 
the analysis consisted of calculating 
deviations from average regression 
and deviations from individual re- 
gression and testing the mean square 
for the difference between the two 
against an error term. The error 
term in each case is the mean square 
of deviations from individual regres- 
sion. The individual and average 
regression coefficients are shown in 
Table 3. The average regression for 
the table as a whole is significant at 
the .O1 level. The regression coeffi- 
cients for the four rows are signifi- 


TABLE 3 


Recression Coerricients ror First Five 
Biocks or Acquisition TRIALS 


Ratios 





a1 | 6:1 | anes | Mean 
2.6 2.0 | 36 4.10 
14 | 64 | 64 | 3.80 
0 | 24] 18 | 1.30 

0.8 |-04 | -0.30 
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Fic. 3. Percentage of choices to profitable 


from each other 
An inspection of Table 
3 shows that as the difference in- 
creases the regression coefficients of 
these lines increase. This indicates 
that as the difference between the 
percentages of reinforcement increases 
the rate of learning likewise increases. 

The regression coefficients for the 
four columns are not significantly 
different from each other. An in- 
spection of Table 3 shows that as the 
ratios vary the regression coefficients 
do not vary in any consistent manner. 
This indicates that a change in the 
ratios of the percentages of rein- 
forcement does not alter in any 
consistent manner the rate at which 
learning will take place. 

An analysis similar to that just 
outlined was performed on the ex- 
tinction data. Regression coefficients 
were calculated for the last block of 


cantly different 
(P < Ol). 
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side as a function of trials (in 10-trial blocks). 


acquisition and the first three blocks 
of extinction trials. The individual 
and average regression coefficients are 
shown in Table 4. The average of the 
regression for the table as a whole is 
significant (P < Ol). The regres- 
sion coefficients for the four rows are 
also significantly different from each 
other (P < 01). An inspection of 
Table 4 shows that as the difference 


TABLE 4 


Recression Coerricients ror Last Brock or 
Acquisition and First Turer Biocxs 
or Extincrion TRiats 


Ratios 
Differ 


ences 


| ; 
| : | Mean 
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increases the regression coefficients 
increase in a negative direction. This 
indicates that as the difference be- 
tween the percentages of reinforce- 
ment increases the rate at which 
extinction takes place likewise in- 
creases. 

The regression coefficients for the 
four columns are not significantly 
different from each other, even though 
there is a nonsignificant tendency for 
the regression coefficient to decrease 
as the ratio increases. 


It may be stated, therefore, that the 
rate of learning, the final level of learning, 
and the rate of extinction vary with the 
difference of probability of reward but 
not with the ratio of probability of 
reward. 

These findings clearly disconfirm the 
hypothesis that acquisition of a two- 
choice discrimination response is a func- 
tion of the ratio of the probability of 
reward when the difference of the 
probability of reward is held constant. 
We do, however, obtain the result that 
acquisition and extinction of the response 
is dependent upon the difference in the 
probability of reward. Here, Brunswik 
(1, p. 185) is confirmed. The absence of 
any interaction between difference of 
probability and ratio of probability 
further indicates the relationship is 
unequivocal, 

For the range of differences in reward 
sampled here, we have found no indi- 
cation of a threshold value for discrimi- 
nation of probability of reward. If a 
threshold had been obtained, then there 
would be an interaction between ratio of 
reward and difference of reward. This 
did not occur. Learning and extinction 
in this experiment were always a direct 
function of the increasing difference of 
reward on one side of the discrimination 
versus reward on the other side.* 


4 These results may be interpreted in terms of 
a statistical learning model (3). This model, 


SUMMARY 


The purpose of the experiment was to investi- 
gate the effect of probability of reward in a two- 
choice learning task. ‘Two conditions, proba- 
bility ratio of reward and probability difference 
of reward on the two choices, were used. These 
were fitted to a double classification factorial 
design with four levels of ratios and four of 
differences. Each of the 16 schedules were 
randomly assigned to human Ss (N = 80). One 
hundred and twenty acquisition and 60 extinc- 
tion trials were run. 

The data on rate of learning, final level of 
learning, and rate of extinction were subjected to 
an analysis of variance and to a regression 
analysis. Both analyses show that the per- 
formance measures varied positively with 
probability difference of reward, but showed no 
consistent variation with probability ratio of 
reward. 

These findings only partially confirm 
Brunswik’s original hypotheses on probability 
learning. 
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however, as Brunswik (2) has already pointed 
out is, in principle, different from what he would 
propose as the role of probability in learning 
because the experimental designs lack repre- 
sentativeness. If representativeness is an em- 
pirical issue, however, it would be expected that 
where it has been built into an experiment, as it 
was in this study, there should be confirmation 
of Brunswik’s theoretical position. Unhappily, 
the contrary was found to be the case. 
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EFFECT OF DIFFERENT PERCENTAGES OF MONEY 
REWARD ON EXTINCTION OF A LEVER-PULLING 
RESPONSE ! 


DONALD J. LEWIS AND CARL P. DUNCAN 


Northwestern University 


The purpose of this study was to 
determine the effect of different 
percentages of reward upon extinction 
of a lever-pulling response. The 
study differs from most others in the 
area of partial reinforcement in three 
principal ways. First, a larger num- 
ber of different percentages of rein- 
forcement is used than have prev- 
iously been employed. Second, the 
reward used (money) was highly 
meaningful to the human Ss. And, 
third, the task used—playing a slot 
machine——was one that had _ con- 
siderable intrinsic interest for the Ss. 

One of the more reliable “laws” of 
experimental psychology is that con- 


cerning the effects of partial rein- 
forcement which holds that a partially 
reinforced response is more resistant to 
extinction than a continuously rein- 


forced one. This law has been dem- 
onstrated to hold for a considerable 
number of stimulus situations and 
response systems (3). Attempts to 
determine the extent of application of 
the law of partial reinforcement are 
important for at least two reasons: 
(a) theorists (1) have made important 
extensions of behavior-theory prin- 
ciples, admitting that the generality 
of these principles needs to be deter- 
mined experimentally, and (b) study- 
ing stimulus situations and response 
systems different from those com- 
monly used may well yield informa- 

1 This study was supported in part by a grant 
from the National Science Foundation. The 
authors wish to thank S. A. Kendrick, C. Meyer, 


J. Mintz, and E. Lindquist, for their assistance 
with this experiment. 
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tion about the 
reinforcement. 


nature partial 


Metuop 


Apparatus.—The apparatus was a standard 
25-cent slot machine (“one-armed bandit”) 
which had undergone modification for the 
purposes of this research. The machine was 
played with chromium-plated steel discs which, 
upon being placed by S in the coin slot, even- 
tually found their way to the cash drawer of the 
machine. The cash drawer contained almost 
a thousand discs, and constituted the supply 
reservoir upon which S drew when he needed 
more discs for his play. 

The S was paid off from a separate supply of 
discs which were stored in a tube that led directly 
to the payoff tray on the front of the machine 
Release of a disc from the tube into the payoff 
tray was accomplished by activation of a trigger 
attached to the tube. The trigger was tripped 
by brads held in holes drilled in the perimeter of 
a wheel which was mounted inside the machine 
The E determined beforehand which plays would 
result in payoffs simply by inserting brads into 
holes in the wheel. 

A counter mounted inside the machine 
cumulated the total number of plays for each S. 

Since the machine was played with discs, 
rather than quarters, and since S was never 
rewarded with more than one disc even on a 
payoff play, the 25¢ symbol and the plaque 
describing the original payoff system, both of 
which appeared on the front of the machine, 
were covered with masking tape. ‘The pictures 
of the various fruits and other objects, which 
revolved behind the glass window with each play, 
were unchanged, Except for the masking tape 
and the use of discs, the machine looked and 
sounded like an ordinary slot machine. 

Design.—A total of seven groups of 50 Ss 
each were run, each group having an 


“acqui 
sition” and 


“extinction” The ac 
quisition series, for each group, consisted of the 
first eight plays. After the acquisition series 
the extinction series began; no S ever won after 
the eighth play. 

It was thought that perhaps pattern of rein- 


an scries 
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forcement might be important, especially with 
low percentages where pattern can vary a good 
deal within a percentage. So a pilot study on 
patterns was carried out, using 25% reinforce- 
ment. Four groups, 25 Ss each, were paid off 
on the following patterns: ‘Trials 1 and 8, 3 and 
8,5 and 8, and 7 and 8, The mean log plays to 
extinction for these groups did not differ sig- 
nificantly; F was less than one. ‘Therefore, the 
pattern for the other partial reinforcement 
groups was chosen rather arbitrarily according 
to the criteria described below. 

Payoff trials were selected to be irregular 
insofar as possible within any percentage group, 
and so that if any payoffs were to be given at all, 
one would be on Trial 8. This made it mean- 
ingful to talk about an 8-trial acquisition series. 
The plays that resulted in a reward are shown 
by Table I, 

Assignment of Ss.—The Ss were 350 men and 
women from the introductory psychology classes 
at Northwestern University, The experiment 
was run in two phases, Assignment of Ss for 
Phase 1 was as follows: The 100% and 50% 
groups first received a complement of 30 Ss. 
Then the 25% and 0% groups each received 30 
Ss, and finally 30 Ss were assigned to the 124% 
374%, and 75% groups. Phase 2 of the experi- 
ment then began. ‘Twenty additional Ss were 
assigned to each of the seven groups in turn as 
Ss came to the laboratory. ‘The Ss for the total 
experiment were drawn from six different 
academic quarters (and from all seasons of the 
year). 

Because of the method of assignment used 
during Phase 1—a method made necessary by 
practical considerations—there was a possibility 
that group differences were caused by assignment 
rather than by the experimental variable. To 
determine whether this happened, data from the 
two phases were analyzed separately. 

Instructions.—The instructions, which were 
spoken to S from memory, were as follows: 


“Have you played a slot machine before? 
{If S says, “yes” ] Then you know there is 
nothing complicated about it. [If S says, 
“no” ] Well, it’s very simple; I'll show you, 
You play the machine with these discs out of 
the drawer back here. [About 15 discs are 
taken from the cash drawer while S watches. ] 
Whenever you want more discs to play with, just 
take them out of this drawer; then push the 
drawer back in. [E£ pushes drawer back.] 
You play this machine in the usual way. Just 
put a disc in the slot and pull the lever all the 
way down. The idea is to win as many of these 
discs as you can. The ones you win will fall 
into this tray here, and later I'll pay you a 
nickel for each one you win. Don’t use the 
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ones you win to play with. Whenever you want 
more to play with, just take them out of the 
drawer. You can play as long as you want. 
Just keep taking discs out of the drawer and 
play until you decide to quit. Then bring over 
what you have won and cash them in. I won’t 
answer any more questions now or while you 
are playing, but when you quit, bring over the 
discs you have won, cash them in, and we will 
talk about it at that time. One other thing. 
You can’t influence any slot machine by how 
hard you pull the lever, so just give it a normal 
pull all the way down. Remember, you can 
play as long as you want. Okey, begin when 
you are ready,” 

Procedure-—All Ss were run individually. 
Preliminary experimentation indicated that a 30- 
min, experimental period was sufficiently long to 
allow S to play as long as he wanted. Only 
two Ss would have exceeded this time limit if 
allowed to continue; their data were discarded. 

While S played E sat reading at a table in 
the same room, When S was through playing, 
he was given a nickel for each disc that he had 
won. He was then told that he had participated 
in a study on incidental learning, and was asked 
if he could remember what combinations of 
fruit pictures had paid off, which ones had not, 
which ones came up most often, etc. He was 
then asked not to tell anyone what the purpose 
of the study was. It was hoped that this would 
prevent the true nature of the experiment from 
being known by Ss before and after they played. 
There was no evidence to indicate that this was 
not the case. 


Resutts anp Discussion 


The raw score for each S was simply 
the total number of plays (including 


the eight acquisition plays). Because 
the distributions of raw scores for all 
groups were positively skewed, the 
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scores were transformed into logs 
which resulted in a satisfactory nor- 
malization. 

Since the experiment was run in 
two phases, as described earlier, it 
was necessary to determine if there 
were phase differences that affected 
the experimental group differences. 
The F less than 1 (see Table 2) 
attributable to the phases indicates 
that the assignment of Ss did not 
result in significant differences, and 
the interaction F, again less than lI, 
indicates that the phase assignment 
did not interact significantly with the 
major variable. 

In Table 2 the F of 4.41, which is 
significant beyond the .O1 level, indi- 
cates that the percentage of reinforce- 
ment had a very significant effect 
upon total number of plays. This 
effect is shown graphically in Fig. 1. 
It seems clear that the persistence of 
the lever-pulling response is an inverse 
function of the percentage of reward. 
That partial reinforcement is effective 
in increasing the resistance to extinc- 
tion of the response as compared to 
continuous reinforcement is indicated 
by the ¢ of 3.72 for the comparison 
between the 100% group and the 
pooled partial reinforcement groups. 
It also appears that no reinforcement 
at all increases the total number of 
plays as compared to 
reinforcement (t = 6.12). 


continuous 


Grant and Schipper (2) determined the 
effectiveness of 0%, 25%, 50%, 75%, and 
100% reinforcement on the resistance to 
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Fic. 1. Mean log trials to extinction as a 
function of the percentage of trials reinforced 
during acquisition. 


extinction of a conditioned eyelid re- 
They found that resistance to 
extinction was greatest in the 50°], and 
75% groups, falling off somewhat in the 
25% and 100% groups. The 0% group 
showed very little initial conditioning 
and consequently no resistance to ex- 
tinction. Grant Schipper 
preted their results in terms of two 
processes. The first had to do with the 
habit strength at the beginning of ex- 
tinction. If the habit is weak it will 
extinguish quickly, even though it was 
acquired under a partial reinforcement 


sponse. 


and inter- 


schedule. Grant and Schipper stated 
that the 25% reinforcement schedule 
resulted in inferior resistance to extin« 
tion because it in all probability gives 
barely sufficient reinforcement for the 
response to appear consistently during 
acquisition. The second process they 
describe as the ease of discriminating the 
acquisition from the extinction 
series, and they believe that this dis- 
crimination is a function of the similarity 
between acquisition and extinction. 
Putting these two processes together, 
the greatest 
should occur 
strength is relatively large and dis 
crimination difficult. This maximum 
occurred somewhere between 50°, and 
75% reinforcement in the and 
Schipper situation, 


serics 


resistance to 
both 


extinction 


where response 


Grant 
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Our results can perhaps be interpreted 
using only the second of these two 
processes. It seems likely that the 
schedules of reinforcement had no effect 
upon response strength, at least in the 
sense of bringing about differential 
habits, for the response used in this 
situation. The response of lever-pulling 
is a relatively simple one, and one that 
gets practiced sufficiently in a variety of 
situations during the life of a young 
adult to be at maximum strength in all 
of our Ss. The few additional trials of 
the acquisition series, therefore, probably 
did not increase the habit strength of any 
of the Ss. These acquisition trials did 
have an effect, however, in setting the 
acquisition series apart from the extinc- 
tion series, the second of Grant and 
Schipper’s two processes. 

Because continuous reinforcement is 
most unlike continuous nonreinforce- 
ment, the discrimination between ac- 
quisition and extinction should be easiest 
for the 100% Ss and they should quit 
first when extinction starts. Ease of 
discrimination should be next greatest 
for the 75% group, then for the 50% 
group, and so on through the 124% 
group. Using only the ease of dis- 
crimination process, a general inverse 
relation should result between percentage 
reinforcement and resistance to extinc- 
tion, such as shown in Fig. 1. 

The 0% group of the present experi- 
ment is somewhat unusual in that it does 
not represent a response of weak or no 
habit strength as it does in the Grant and 
Schipper experiment and in most experi- 
ments where it has been used. The 
discrimination process, however, seems 
helpful in explaining the results from this 
group also. Because continuous non- 
reinforcement is identical with further 
continuous nonreinforcement, the dif- 
ference between the first eight plays and 
the succeeding plays is negligible, making 
an extinction difficult to 
criminate. 

Lewis (4), using 7-yr.-old boys as Ss 
and button-pushing response, found that 


series dis- 


50% and 60% reinforcement resulted in 
superior resistance to extinction over 
100%, but that the partial percentages 
also resulted in superior resistance to 
extinction over 0%. The difference 
between 0% and 100% was in favor of 
the 0% group but was not significant. 
A graph of his results yields a curve much 
like that given by Grant and Schipper 
instead of the nearly linear trend of the 
present experiment. Perhaps for such 
young Ss, with restricted learning op- 
portunities, we must also utilize the first 
of the two processes. Assuming the 
button-pushing response to be below 
maximum strength in the young boys, 
the response would be increased as a 
function of the number of reinforcements. 
The effects of reinforcement together 
with the process of discrimination could 
result in a maximum at about 50% to 
60% reinforcement. 

It could also be argued that the sched- 
ule of reinforcement may affect the 
motivation of Ss. If one considers the 
general motivation of Ss in this experi- 
ment to be the desire to win, then this 
motivation would be reduced most by 
100% reinforcement, less by 50% rein- 
forcement and least by 0% reinforce- 
ment, again giving us a trend between 
percentage of reinforcement and number 
of plays to extinction. 


SUMMARY 


The purpose of this study was to determine 
the effect of seven different percentages of 
reward upon extinction of a lever-pulling re- 
sponse. An ordinary slot machine was modified 
so that E could determine both the total number 
of payoffs (rewards) and the exact trial on which 
payoff would be made. Seven groups of fifty 
human Ss each were run under different per- 
centages—100%, 75%, 50%, 374%, 25%, 124%, 
and O0%—of reward. The acquisition series 
was the first eight plays, after which no S ever 
won again. The response measure was the 
total number of plays prior to S’s decision to 
quit. There was a general inverse relation 
between percentage of reinforcement and 
number of plays to quitting. The ease of dis- 
criminating between acquisition and “extine- 
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tion” was believed to play an important role 
in determining the inverse relation. 
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AN ANALYSIS OF THE EFFECT OF NONREINFORCED 
TRIALS IN TERMS OF STATISTICAL 
LEARNING THEORY 


RICHARD C. ATKINSON ! 


Indiana University 


It is the purpose of this study to 
investigate the effect of nonreinforced 
trials in a Humphreys-type “verbal 
conditioning” situation. Before 
specifying the problem, however, a 
synopsis of both the situation and the 
theoretical orientation to be employed 
will be presented by referring to an 
article by Estes and Straughan. 

We consider a situation in which 
each trial begins with presentation of 
a signal, or CS; following the signal, 
one or the other of two reinforcing 
stimuli, E, or Ey, occurs, the proba- 
bility of FE, and EF, during a given 
series being and 1 — , respectively. 
The behaviors available to S are 
categorized into two classes, A, and 


In the 


Ax, by experimental criteria. 
verbal conditioning situation, A, is a 
prediction that F, will occur, and A, 
a prediction that EF, will occur on the 


given trial. We assume that the CS 
determines a population, §,, of stimu- 
lus elements which is sampled by S on 
each trial, the proportion @ of the 
elements in this population consti- 
tuting the effective sample on any one 
trial. The dependence of S’s re- 
sponses upon the stimulating situation 
is expressed in the theory by defining 
a conditional relationship such that 
each element in S, is conditioned to 
(tends to evoke) either A, or Ay. In 
order to interpret the formal model in 
terms of a verbal conditioning experi- 
ment, we assume that when an E, 
occurs it evokes from S a response 


' The author wishes to thank Professor W. K. 
Estes for advice and assistance in carrying out 
this research. 


belonging to class Aj, i.e., one which 
is compatible with the response of 
predicting E, but which interferes 
with the response of prediction Eo», 
and that when an E, occurs it evokes 
a response of class Ay. Then on a 
trial on which E, occurs we expect on 
the basis of association principles that 
all elements sampled from S, on the 
trial will become conditioned to Aj, 
while on an FE; trial the sample will 
be conditioned to A». Now if suc- 
cessive trials are sufficiently discrete 
so that samples from §, are statis- 
tically independent, the probability 
of an A, after Trial n, abbreviated 
p(n), is defined in the model as the 
proportion of elements in S, that are 
conditioned to Ay, and similarly for 
the probability of an A», [1 — p(n) ] 
(2, p. 225). 

An interesting problem arises when 
one modifies this situation by intro- 
ducing trials on which neither FE, nor 
E. occurs, that is, trials on which S is 
given no information as to whether or 
not his predictive response (A, or A:) 
was correct. What prediction would 
one venture concerning the effect of 
such nonreinforced trials on the 
probability of an A, or A, response on 
subsequent trials? This paper pro- 
poses and examines the following 
three hypotheses which are stated in 
terms of statistical association con- 
cepts (1). 

Hypothesis I, (H-I).—Nonreinforced trials 
have no effect on the hypothetical connection 
between stimulus elements and response classes. 


Elements sampled from the S, population on such 
trials are returned to the population at the end 
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of the trial with their conditional status un- 
changed. That is to say, if an element is 
sampled on a nonreinforced trial, it will remain 
connected to the same response class that it 
was connected to before the start of the trial. 

Hypothesis 11, (11-11).—In the absence of the 
E, and E, events, S’s overt predictive A; or Az 
response is viewed as a terminal response for the 
trial, and it is posited that elements sampled on 
such nonreinforced trials are conditioned to the 
A, or Az response emitted on the trial. This 
notion is similar to Guthrie’s assumption (3) that 
the last response emitted in the presence of a 
stimulus pattern is the one which will tend to be 
produced when the stimulus pattern recurs. 

Hypothesis 111, (H-11T).—Elements sampled 
on a nonreinforced trial are connected to the A, 
response class with probability a, and to the Ag 
response class with probability 1 — o. We shall 
only consider the case where 0 = 1 —o@ = 5, 
Here we assume that, in the absence of infor- 
mation as to which response was correct, there 
exists an equal likelihood that sampled elements 
will be connected to either response class. 


In order to evaluate these hy- 
potheses we shall use an approach 
similar to that employed by Estes 
and Straughan (2). For any schedule 
of E, and EF; trials they were able, by 
means of statistical learning theory, 
to predict asymptotic response proba- 
bility. Similarly, for any schedule of 
F,, FE. and nonreinforced trials we 
can predict asymptotic response 
probability under the conditions of 
each of the hypotheses stated above. 
The problem then of defining the 
effect of nonreinforced trials on be- 
havior can be formulated in terms of 
the long run effect of schedules of 
reinforced and nonreinforced trials on 
asymptotic response probability. 

There is, however, the problem that 
not all schedules of F,, E., and non- 
reinforced trials will yield differential 
predictions for the three hypotheses. 
For example, in the situation em- 
ployed by Estes and Straughan (2), 
it can be shown that the inclusion of 
nonreinforced trials with any fixed 
probability in the schedule of E,’s 
and E,’s would yield identical asymp- 


tote and slope predictions (a) for 
H-I and H-II for all values of r and 
(b) for H-I, H-II, and H-III for 
w= 5. 

Consequently, a schedule of events 
is required which will yield differential 
predictions for the three hypotheses. 
We have selected the following sched- 
ule which, with the substitution of 
appropriate experimental parameter 
values, meets this requirement: 


1. If an A; occurs on Trial n, there 
is: (a) a probability Bw that an FE, 
will occur, (b) a probability 6(1 1) 
that an EF, will occur, (c) a probability 
(1 — 8) that neither an F, nor EF, will 
occur. 

2. If an Az occurs on Trial n, there 
is: (a) a probability # that an FE, 
will occur, (6) a probability (1 — ) 
that an FE, will occur. 


By methods of finite calculus we 
can describe the expected asymptotic 
probability of an A, response, denoted 
by p(# ), for the above schedule under 
the conditions of each hypothesis. 


By H-I, p(@) is bounded by 
, 


e+ A(l — wr) 


x and 


By H-II, 

——— ee 

AO PST al - of 
By H-III, 
rT 


—_ = ’ 3 
sao mo-e 


pla) = 4 


The derivations of these results are 
given in the next section. 


MATHEMATICAL FORMULATION 


For the schedule specified above, the proba- 
bility of an A, response after Trial n, p(n + 1), 
is: 
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By H-I, 


p(n + 1) = (1 — O)p(n) + O[[1 — p(n) 
x [r(l) + (1 — w)(0)] 
+ p(n)Alr(1) + (1 — w)(0)J 
+ p(n)(1 — B)p(n)}. (4) 


The proportion (1 — 6) of the elements is not 
sampled and remains unchanged. The pro- 
portion 6 of the elements is sampled and con- 
ditioned as follows: (a) If an A, occurs, with 
probability [1 — p(n)], there is a probability = 
that all sampled elements will be connected to 
A,, and a probability (1 — w) that the elements 
will be connected to Az; (b) If an A, occurs, with 
probability p(n), there is a probability Bw that 
all sampled elements will be conditioned to Au, 
a probability 6(1 — w) that the elements will be 
conditioned to A,, and a probability (1 — @) 
that the conditional status of the elements will 
remain unchanged. 


Rewrite Equation 4 


p(n + 1) = (1 — O p(n) + OfA(n) 
+ p(n)*(1— B)] (5) 
where 
A(n) = [1 — p(n) Jw + p(n)Br. (6) 
By H-Il, 


p(n + 1) = (1 — O)p(n) + O[A(n) 

+ p(n)(lL—6)}. (7) 
This equation is isomorphic with Equation 5 
except for the final product of the last term. If 
an A, occurs and is not followed by an EF, or Es, 
this joint event having probability p(n)(1 — 8), 
then all elements will be conditioned to the 
response emitted on the trial, namely, Aj. 


By H-III, 


p(n + 1) = (1 — O)p(n) + OLA(n) 
+ p(n)(1 — B)(.5)]. (8) 

This equation is also isomorphic with Equation 
5 except for the final product of the last term. 
If an A; event occurs and is not followed by an 
E, or Ey, then there is a probability .5 that 
sampled elements will be connected to Ay. 

Taking the expected value of Equation 7 and 
solving the resulting equation (4) we obtain 


pin) = pl~) — [p(@) — pO) 
x [(l1-—@+e01—,6)(1—-)}*, (9) 


where p() is given by Equation 2. 
Similarly for Equation 8 


pin) = p(@) — [p(~) — pO] 
x (1 ~ @+0(1 — 8)(.5 — x))*, 


where pl #) is given by Equation 3. 


(10) 
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Equation 5 is a second-order difference 
equation, and no method is known for effecting 
a solution. However, by taking the expected 
value and letting n —+ « we obtain 


r+il — B)p(=)? 


( = 
Ae) 1 + w(1 — 8) 


(11) 
An upper and lower bound can now be calculated 
for pi 2) by letting the variance of p(#) assume 
a minimum value of zero and a tsaximum of 
p(1 — p). When variance is minimal p* = 7’, 
when maximal p = p*. ‘These substitutions in 
Equation 11 yield boundary conditions on p(#) 
given by Equation I. 


Metuop 


Design.—Two groups, A and B, were neces- 
sary for good differentiation between the 
hypotheses. For Group A the value of + was 
8125; for Group B the value of r was .1875. 
The value of 8 was .25 for both groups. 

Apparatus.—The E and S sat on opposite 
sides of a table with a large opaque screen 
separating them. The apparatus, as viewed 
from S’s side, consisted of two telegraph keys 
mounted 24 in. apart at right angles to the base 
of the screen; upon the screen were mounted 
three milk-glass panel lights. One of these 
lights, which served as the signal for S to guess, 
was centered between the telegraph keys at 
S’s eye level. Each of the two remaining lights, 
the reinforcing signals, was mounted directly 
above one of the telegraph keys. The pres- 
entation and duration of the lights were 
automatically controlled, and on any trial the 
reinforcing signal could be withheld. 

Subjects.—The Ss, 52 students obtained from 
beginning courses in psychology during the fall 
semester of 1954, were assigned at random to 
experimental groups. 

Procedure.—After being seated in front of the 
screen S was read instructions, similar to those 
reported by Estes and Straughan (2), requesting 
that he respond to the signal light on each trial 
by operating the telegraph key corresponding to 
the reinforcing light that he expected to follow. 
For each S, one of the two reinforcing lights was 
randomly designated as E,, and the other as F2. 
After the instructions were read, four practice 
trials were given with the lights presented in the 
order Es, F;, Ei, Es. After these trials £ cor- 
rected any apparent errors in procedure and 
reviewed the instructions. 

Following the practice trials, 144 trials were 
given each S. These trials were run off in 
continuous sequence. For each S a sequence 
of FE, Es, and nonreinforced trials was generated 
in accordance with the assigned values of # and @. 
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On each trial the signal light was lighted for 
2.5 sec.; the time between successive signal 
exposures was 7.5 sec. On reinforced trials the 


reinforcing light followed the cessation of the 


signal light by 2 sec. and remained on for 1 sec. 


REsuLtTs AND Discussion 


Substituting values of w and B in 
Equations 1, 2, and 3, p(~) can be 
described under the conditions of each 
hypothesis. 


Predictions for Group A are 

1. By H-I, p(*) is bounded by 
8125 and .9455. 

2. By H-II, p(*) = .9455. 

3. By H-III, p(@) = .6582. 


Predictions for Group B are 


1. By H-I, p(*) is bounded by 
1875 and .4800. 

2. By H-II, p(~) = 4800. 

3. By H-III, p(*%) = .2449. 


Taking the proportion of A, re- 
sponses over the last two blocks of 
16 trials as an estimate of p(*”) we 
can compare these values with the 
above set of predictions. By in- 
spection of Fig. 1 we see that for 
Group A the proportions for the last 
two trial blocks are .828 and .842, 
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Fic. 1. Empirical and theoretical curves 
representing mean proportion of A, responses per 
16-trial blocks. 


with a mean of .835. To determine 
how well this value approximated the 
predictions of H-II and H-III, ¢ tests 
were run between the predicted value 
and the proportions computed for 
individual Ss over the last 32 trials. 
Theo, was .O17. The difference 
between the predicted value and the 
mean of the observed values was .110 
for H-II and .175 for H-III; both 
differences are significant at the .05 
level. By inspection of Fig. 1 we see 
that for Group B the proportions for 
the last two trial blocks are .315 and 
.337 with a mean of .326. The a,, 
021. The differences between 
predicted and observed values were 
.154 and .O81 for H-II and H-III, 
respectively; both are significant at 
the .05 level. 

In view of these results, it is clear 
that neither H-II nor H-III accounts 
for the obtained What then 
can be said concerning H-I? First 
of all, the proportions computed over 
the last 32 trials for both groups fall 
within the bounds for p(«) specified 
by H-I. However, more important 
is the fact that we can generate curves 


was 


data. 


by H-I which closely approximate the 
observed results. By employing a 
Monte Carlo method (5) 10 individual 
learning curves were generated under 
the conditions of Equation 4, five 
the experimental parameter 
values of Group A, and five using 
those of Group B. The @ and p(0) 
values employed for Group A were 
063 and .480; respective values for 
Group B were .O10 and .550. The 
averages by blocks of 16 trials for 
each group of five curves are presented 
in Fig. 1. 


using 


As can be seen, there is 
close agreement between these values 
and the observed values. 

We conclude that, of the three 
hypotheses presented, only H-I can 
account for the findings of this study. 
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SUMMARY 


This study employs a modified Estes- 
Straughan “verbal conditioning” situation. 
Each trial consists in the presentation of a 
signal followed by an FE, or FE reinforcing light; 
S operates an appropriate key to indicate his 
prediction as to which light will appear on each 
trial. The present study deals with the effect 
of introducing trials on which neither reinforcing 
light is presented, that is, trials on which S is 
given no information as to which of the two 
alternative predictions is correct. 

Three hypotheses are developed, in terms of 
statistical learning theory, concerning the effect 
of such trials. Since some schedules of E,, Es, 
and nonreinforced trials provide identical pre- 
dictions from the three hypotheses, it was 
necessary to employ a schedule which yielded 
differential prediction. ‘The one selected was 


such that on any trial (a) if S predicted E, then 
E;, Es, or no reinforcement would occur with 
probability 6x, 8(1 — w) and 1 — 8, respectively, 
while (b) if S predicted F, either FE; or Ey would 
occur with probability and 1 — x. 

The set of quantitative predictions generated 


by each hypothesis was checked against experi- 
mental results. It was concluded that the 
hypothesis requiring zero change in response 
probability on trials on which no information is 
given best accounts for the findings. 


REFERENCES 


1. Esres, W. K., & Burke, C. J. A theory of 
stimulus variability in learning. Psy- 
chol. Rev., 1953, 60, 276-286. 

2. Estes, W. K., & Srraucnan, J. H. Analysis 
of a verbal conditioning situation in terms 
of statistical learning theory. J. exp. 
Psychol., 1954, 47, 225-234. 

3. Gurnraie, E. R. 
psychological 
1946, 43, 1-20. 

4. Jonpan, C. Calculus of finite differences. 
New York: Chelsea, 1950. 

5. McCracken, D. D. The Monte Carlo 
method. Scientific Amer., May 1955, 
H-96. 


Psychological facts and 
theory. Psychol. Bull., 


(Received June 30, 1955) 





Journal of Experimental Psychology 
Vol. 52, No. 1, 1956 
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MEMORY EXPERIMENTS! 
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The problem of subdividing con- 
nected verbal material for quantita- 
tive scoring of recall occurs in the 
construction of memory scales, in 
laboratory studies of rumor trans- 
mission, and in investigations of the 
effects of attitude, frame of reference, 
and other variables on memory. The 
manner in which the material is 
divided may have significant effects 
on experimental results, yet few re- 
searchers have paid much attention 
to it. Not infrequently the only 
reference to preparation for scoring in 
an experimental report is the notation 
that the material was divided into 
“items,” or “ideas.”” This technique 


is the one which is commonly used in 
breaking up prose for quantitative 


analysis. That is to say, the stimulus 
passage is divided into a number of 
“meaning units.” A meaning unit is 
a linked set of words, usually less than 
a sentence, which E regards as an 
ideational entity. The procedure was 
probably first applied by Henderson 
(3) over fifty years ago, and has since 
been used widely in investigations of 
memory. The following subtest of 
the Wechsler Memory Scale (12) will 
serve as an illustration. The units 
are separated by diagonals. 


' This paper is based in part on a section of a 
dissertation submitted in partial fulfillment of 
the degree of Doctor of Philosophy in the De- 
partment of Psychology, Columbia University, 
in 1952. The author is indebted to Drs. Otto 
Klineberg and Henry E. Garrett for their direc- 
tion of the dissertation. The experimental sec- 
tions were done while the author was at the 
Child Welfare Research Station, University of 
Iowa. 


Anna Thompson/ of South/ Boston/ em- 
ployed/ as a scrub woman/ in an office 
building/ ‘reported/ at the City Hall/ Sta- 
tion/ that she had been held up/ on State 
Street/ the night before/ and robbed/ of 
fifteen dollars/. She had four/ little chil- 
dren/ the rent/ was due/ and they had not 
eaten/ for two days/. The officers/ touched 
by the woman's story/ made up a purse/ 
for her/. 


The procedure which is now em- 
ployed is the classic method of re- 
tained members, in which it is 
assumed that the stimulus cannot be 
recalled totally by S, and his recall 
score will be the number of units 
which he can reproduce to the satis- 
faction of E. There are a number of 
different ways of determining ade- 
quate recall and of assigning partial 
credit, such as those of Henderson 
(3), Lyon (8), and Clark (2), among 
others. The present investigation, 
however, is concerned only with 
division of the stimulus. 

It is evident that any method for 
creating meaning units or idea groups 
will be more or less arbitrary. There 
have been no attempts to define idea 
groups empirically, that is, based on 
the performances of a group of Ss, and 
there is considerable doubt that such 
a task could be accomplished satis- 
factorily. The fact that the method 
will be entirely at E’s whim makes it 
all the more unfortunate that few 
seem concerned with the methodology. 
Suppose, for instance, that a given S 
recalls 6 idea groups of the 24 in 
Wechsler’s passage. His recall! score 
is then 25%. His performance in- 
cluded the phrase “of ten dollars,” 
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instead of the correct “of fifteen 
dollars.”” Hence he received no credit 
for this idea group. Now if we had 
divided the phrase originally into two 
idea groups, thus: / of fifteen/ 
dollars/, S would have been credited 
with one of two idea groups where he 
received zero credit before. His re- 
call score is now 7/25 = 28%. We 
have increased this S’s score by fully 
12% of his original score with one 
simple change of a meaning unit. 


The full significance of this alteration 
can be seen if we realize the multiplicity 
of ways in which a prose passage can be 
divided for scoring. The number of 
possible arrangements can be computed 
by means of the following formula if the 
maximum number of words per idea 
group is known? 


x(x — 1) 


Awzxn- 3 


in which n is the number of words in the 
passage and x is the maximum number of 
words per IG. For Wechsler’s passage 
n= 68 and x = 6. The number of 
possible arrangements is 393. This total 
is somewhat misleading since any prose 
contains a certain number of minor parts 
of speech like the preposition ‘‘of” which 
could logically be grouped with only one 
set of words. If we deal only with nouns, 
pronouns, verbs, and adjectives, for 
Wechsler’s passage, m = 45, x = 5, and 
A=215. Even if we cut down the 
maximum words per IG to 3 there will 
still be 132 possible arrangements. No 
matter what kind of allowances are made 
there will nearly always be a considerable 
number of arrangements even for a 
relatively short piece of prose. When 
the stimuli contain several hundred 
words as in the studies of Clark (2), and 
Levine and Murphy (5), the number of 
arrangements approaches or exceeds a 
thousand. 

A particular 


arrangement can be 


* The author wishes to express his gratitude to 
Dr. Robert Plutchik of Hofstra College for de- 
vising this formula. 


TABLE 1 


Means anv SD’s or Worps rer Ipza Group 
in Turee Strupies 





Study _ Mes : | 
| 


Words 


Taft (11) | 2.26 | 120 
Levine-Murphy 294 
(5) 


Clark (2) 





described in terms of the mean number 
of words per IG and the SD of the 
distribution of words per IG. These 
data are shown in Table 1 for six prose 
passages taken from three studies of 
functional memory (2, 5, 11). 

Inspection of Table 1 suggests that 
there are significant differences in vari- 
ance from passage to passage. This is 
confirmed by a Bartlett’s test which 
yields a x? of 52.08, P < .0O1 for 5 df. 
This does not preclude a variance analy- 
sis of the means if we assume that the 


- shapes of the distributions do not differ 


markedly. An F ratio among the means 
is 6.14, which is significant beyond the 
.001 level, and hence may be accepted as 
indicating significance at the .01 level 
(cf. 7). 

We may conclude that different Es 
have used varying kinds of unit groupings 
in experimental memory studies. There 
is a tendency for the mean number of 
words per IG to fluctuate directly with 
the number of words in the passage. 
These differences may account, at least 
in part, for conflicting experimental 
reports, especially reports of the per- 
centage of recall for a given sample. 
They may have confounded results in 
studies in which the ability of a homo- 
geneous group to recall material which 
was favorable (in the broadest sense of 
the word) to the group was contrasted 
with its ability to recall an unfavorable 
or neutral passage. Many of the in- 
vestigations of functional mnemonics, 
such as those of Alper and Korchin (1), 
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TABLE 2 


Numeer or Ipea Groups per Passace For 
Four Passaces, as Determined 
py 31 Fs 


Idea Groups 
Passage Words 


A 58 6.38 
B 46 : 5.11 
> |e 6.09 
> i 6.06 





Mean $6.25 10.48 





Clark (2), Levine and Murphy (5), 
Menzies (9), and Taft (11), fall into this 
category. For example, Clark found 
that his Ss recalled more of the affective 
passage (II) than they did of the neutral 
passage (III). However, the variances 
of these two passages differ significantly 
(F = 1.84, P = .05), with the affective 
passage being more variable. It is 
possible that greater variability in the 
number of words per IG_ enhances 
measured recall, thereby confounding 
Clark’s results. The same consideration 
applies to the Levine-Murphy study in 
which both the pro-Communist and 
anti-Communist groups recalled the pro- 
Communist passage better than the 
anti-Communist passage. The variance 
ratio is again significant (F = 2.27, 
P = .01), with the pro-Communist pas- 
sage having the greater variability. 


In the light of the foregoing dis- 
cussion, three experiments were under- 
taken. The purpose of the first was 
to determine whether a group of Es, 
guided by the same instructions, will 
divide passages in different ways. 
The purpose of the second was to 
investigate the effects of differences 
in mean words per IG on the mean 
recall score of a group of Ss, and on 
the relative position of Ss within the 
group. The aim of the third was to 
determine the effect of variability of 
the number of words per IG on the 


mean recall score and on relative 


position, 
Experiment | 


Method.—Two connected verbal passages of 
the Wechsler Memory Scale (12) and two pas- 
sages from the Stone-Girdner-Albrecht alternate 
form of the scale (10) were presented to 31 
faculty members and graduate students at the 
lowa Child Welfare Research Station, with the 
following instructions: “Imagine that you are 
conducting an experiment in which it is necessary 
to obtain quantitative scores representing the 
recall of these four passages by a group of sub- 
jects. The purpose of your experiment is im- 
material; all that matters is that you need an 
objective measure of the subjects’ ability to 
recall the stimuli. Divide each passage into 
‘units’ in some fashion. ‘The procedure you use 
is entirely at your own discretion. There are 
no limitations or conditions except those already 
explicitly stated.” 

Only two members of the group had any prior 
acquaintance with the memory scales, and both 
later stated that their familiarity was insufficient 
to influence their performances. 


Results —The number of IG’s per 
passage was counted for each E's 
division of each passage. The data 
are shown in Table 2. Passages A 
and B are from the alternate form, C 
(reproduced on p. 0), and D are from 
Wechsler’s original scale. 

The significant variation among Es 
in dividing the passages, as well as 
among the passages themselves, is 
demonstrated by the analysis of 
variance of Table 3. Not only are 
there differences among £s, but the 
same £ is likely to divide different 
passages so as to obtain different 
mean numbers of words per IG. 


TABLE 3 


Anacysis or Variance or [pea Groves or 
Four Passaces py 31 Ls 


| | 


Source | 


[ i 
yw |i ri P 
] 


Square 


ae 5 bs 


128.77 | 23.58 | OO 
38.10 a Ol 


er 
Passages 
EX 


5.46 | 
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Experiment II 


Method.—To determine the effects of varia- 
tions in mean number of words per IG on experi- 
menta! results, Passage C was administered to a 
group of 29 Ss as a rote memory test. Each 
protocol was scored using four different divisions 
of the stimulus passage : Wechsler’s own grouping 
(12) and three groupings selected from eighteen 
in Exp. 1? Grouping T had 11 IG’s, which is 
apparently the smallest number which will yield 
a useable distribution of scores. Grouping U 
had 15 IG’s, V had 24 1G’s (the largest number 
made by any E in Exp. I), and W, Wechsler’s 
division, also had 24. 

Since the passage has 67 words, the groupings 
yield respective mean numbers of words per IG 
of 6.09, 4.47, 2.79, and 2.79. The corresponding 
SD’s are 1.97, 1.33, 1.41, and 1.38. An F ratio 
among means is 16.62, P < .O1, while a Bart- 
lett’s test of the variances results in a x? of 3.29, 
P > 30. Thus, the means differ significantly, 
but the variances are homogeneous. 


Results.-The mean number of IG’s 
recalled as a percentage of the number 
of 1G’s in the passage, and the SD of 
the distribution for the 29 Ss for each 
of the four scorings is shown in the left 
half of Table 4. 

After converting the percentages in 
arcsins, an analysis of the variance 
yields an F of 219.27 for the means, 
and 18.32 for Ss. Both are significant 
beyond the .0O1 level. In general, 
the mean recall score tends to be 
inversely proportional to the mean 
number of words per IG, and directly 
proportional to the number of IG’s. 
It is concluded that variations in the 
mean number of words per IG, apart 
from differences in variances, can 
result in significant differences in the 
recall score of a group of Ss. 

The rank-order correlations among 
ranked recall scores for the four 


* Thirteen of the 31 divisions had less than 11 
IG’s. ‘These divisions failed to yield useable 
distributions of scores. The highest obtained 
score seldom exceeded 3 IG’s, and 50% or more 
of the scores were zero. It appears that when 
the division system has an average of 7 or more 
words per IG, the task of accurate recall of com- 
plete IG’s becomes too difficult, and distribu- 
tions of scores are flattened beyond utility. 


TABLE 4 


Mean Recat Scores 1n PercentTaGes For 
Four Divisions or Passace C, ann Ranx 
INTERCORRELATIONS AmMonc THEM 


Correlations 
Division 








22.73 
31.53 | 
41.13 
41.96 | 


16.45 
15.73 
18.21 | 
17.29 | 








divisions are given in the right half 
of Table 4. The average rho is. 887. 
It is evaluated in the null case by 
means of a _ transformation‘ into 
Kendall’s coefficient of concordance, 
W (4), which is in turn evaluated by 
means of x*,. In this case, W = .91, 
and x?, = 101.92, which is significant 
beyond the .0O1 level for 28 df. 

Evidently, differences in mean num- 
ber of words per IG have very little 
effect on the relative positions of Ss 
within the group, despite the differ- 
ences in mean recall scores. 


ExperiMeEnNtT III 


Method.—To investigate the effects of varia- 
bility of the number of words per IG, four divi- 
sions of Passage C having identical means but 
heterogeneous variances were devised. These 
scorings, which are designed as X, Y, and Z, 
together with W, Wechsler’s division, all con- 
tained 24 IG’s, and hence each had a mean 
number of words per IG of 2.79. The respective 
variances were 4.61, 3.22, .61, and 1.91. A x? 
obtained from a Bartlett's test was 16.61, sig- 
nificant beyond the .0O1 level. Thus the vari- 
ances differ significantly, though the means are 
the same. 

Passage C was again administered to a group 
of 25 Ss as a rote memory test. 


Results.—Since the number of IG’s 
is identical for each scoring, the mean 


4 W is a linear function of the average rho, and 
pm—p+l 
m 
in which m is the number of rankings being aver- 
aged. It can also be computed directly from the 

rankings (cf. 4). 


can be obtained by the formula W = 
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TABLE 5§ 
Mean Recact Scores in I1G’s ror Four 


Divisions or Passace C, anp Rank 
Correrations Amonc Tuem 


IG's Correlations 
Division -- 


| Mean 


W | 10.92! 2.99 
X | 9.96} 3.28 
y | 10.60) 3.05 
Z 988 | 2.92 


number of IG’s recalled by the Ss, 
rather than a percentage, and the 
respective SD’s are listed in the left 
half of Table 5. 

An analysis of variance yields a 
significant F of 4.93, P < .005, among 
means. The variance due to Ss is 
also significant, as expected (F 
= 27.40, P < .0O1). Differences in 
the variability of the number of words 
per IG can thus also produce signifi- 
cant differences in mean recall scores, 
although there is no clear relationship 
between size of the variance and the 
recall score. 

The rank intercorrelations among 
the scorings are shown in the right 
half of Table 5. The coefficients are 
uniformly high, similar to those in 
Table 4. The average rho is .857 and 
W is .89, which is again highly sig- 
nificant. The conclusion is the same 
as that for Exp. II: differences in 
variability of the number of words per 
IG affects the relative positions of Ss 
only slightly, in spite of the differences 
in mean recall scores, 


Discussion 


Three primary conclusions can be 
drawn from the data presented here: 
(a) different Es are likely to prepare 
connected verbal stimuli for quantitative 
experiments in different ways, and in 
addition, an individual E may himself 
divide passages differentially; (4) differ- 


ences in the mean number of words per 
IG can result in significantly different 
mean recall scores for a sample of Ss; 
and (c) differences in variability of the 
number of words per IG can also produce 
different mean recall scores. As a 
secondary conclusion, we can say that 
neither type of difference in division is 
apt to affect relative positions of Ss in 
the group. 

The implications of the experiments 
reported here were suggested in the 
introduction. In a study in which two 
or more passages are used as stimuli, or 
when experimental results are to be 
compared with those of an earlier work, 
the method of division of the stimuli may 
be a pertinent factor. Differences in 
division are likely to cause experimental 
findings to fluctuate, and will thus con- 
found the effects of the experimental 
variable. As previously noted, many 
of the studies of mnemonic dynamics 
may have been subject to this type of 
confounding. 

The results of Exp. II indicate a 
tendency for the mean recall score to 
vary inversely with the number of words 
per IG. Analysis of the data of Table 1 
had suggested that greater variability of 
the number of words per IG makes for a 
higher recall score, but this conclusion 
was not substantiated by the results of 
Exp. III. However, differences in vari- 
ability were uncomplicated by differences 
in mean number of words per IG in 
Exp. III. This was not true of the 
Levine-Murphy (5) and Clark (2) 
studies. This raises the possibility of a 
cumulative effect of differences in both 
means and variability; it is possible that 
effects 
fashion. 

The “methods” for preparing con- 
nected verbal stimuli for quantification 
which have reported in earlier 
studies usually do not actually stipulate 
rules for making the division into units. 
The experiments reported here point up 
the potential value of some real method 
for dividing stimuli so that the mean and 
variance of the number of words per IG 
do not vary from passage to passage. A 


are additive 


in some unknown 


been 
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simple technique which appears to be 
suitable has been devised (6). It con- 
sists of dividing the passage into its 
grammatical entities. The subject of a 
sentence, the predicate, and prepositional 
and other clauses, each constitute an IG. 
The following sentence is divided into 
“grammatical idea groups”: He/ went/ 
for days/ with very little sleep/ and is/ 
so tired/ that he/ can barely move/ a 
muscle/. 

This method usually results in a mean 
of about 2.2 words per IG with an SD 
of about 1.00 for ordinary prose. A fine 
unit is probably the more efficient meas- 
ure, but should it be deemed necessary 
the mean and variability can be increased 
somewhat by combining fo be verbs with 
the predicate, and by adding pronouns 
to the subsequent verb, thus: He went/ 
for days/ with very little sleep/ and is so 
tired/ that he can barely move/ a 
muscle/, 

If the stimulus passages are similar in 
reading ease as measured by any con- 
ventional formula, the use of the gram- 
matical IG should result in divisions 
which do not differ significantly. Even 
numerical differences will usually be 
quite small. 


SUMMARY 


It was determined by an analysis of previously 
published data, and by an experiment reported 
here, that different Zs are likely to prepare con- 
nected verbal material for quantitative scoring 
in different ways. Experiments were performed 
to assess the effects of differences in division on 
experimental data, The results indicate that 
differences in either mean number of words per 
idea group, or in the variability of the number 
of words per idea group, can affect experimental 
findings significantly, and may obscure measure- 
ment of the experimental variable. Relative 
positions of Ss in a group are not affected by 


either type of difference. The use of the gram- 
matical idea group was recommended as a 
method of preparing stimuli which will eliminate 
confounding differences in division. 
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EFFECT OF OVERLEARNING OF A VERBAL RESPONSE 
ON TRANSFER OF TRAINING! 


GEORGE MANDLER AND SHIRLEY H. HEINEMANN 


Harvard University 


A previous study by Mandler (5) 
has described transfer effects as a 
function of prior overlearning of 
motor responses. The present ex- 
periment represents a replication of 
that study in a different, i.e., verbal, 
effector modality. In the preceding 
study Ss were given training on a task 
which required the assembly of dis- 
crete motor units into a complex 
response. Degree of overlearning, 


i.e., practice on the training task 
beyond the stable establishment of 
S-R connections or habits, was related 
to subsequent transfer performance. 
The results indicated increasing posi- 
tive transfer when a practiced re- 
sponse had to be associated to a new 


stimulus, initial negative transfer 
when a new response had to be learned 
to a previously used stimulus, and 
positive transfer at high degrees of 
overlearning when both stimulus and 
response had been used in the training 
task but not previously paired. 
While the first and second findings 
were consistent with previous work 
(9, 11), the last result conflicts with a 
theoretical analysis by Gagné, Baker, 
and Foster (4) and a confirmation of 
their prediction, with verbal material, 
by Porter and Duncan (8). These 
investigators predicted and found 
maximum negative transfer in the 
S-R reversal condition, i.e., when 
previously used stimuli and responses 
were paired in new combinations. 
The present experiment is con- 
cerned with the generality of the 


! This study was supported by a grant from 
the Laboratory of Social Relations, Harvard 
University. 


previous findings for motor behavior. 
Here we have employed a task in 
which verbal units are assembled into 
complex overlearned responses. The 
study of three transfer situations is 
patterned after Bruce’s paradigm (2). 
He also used verbal material, but 
restricted himself to lower degrees of 
training. This study should also 
provide a further test of the theo- 
retical analysis of response factors in 
human learning (6). 


Metruop 


Task and apparatus.—The responses used 
were three-place consonant nonsense syllables 
selected from Witmer’s standardization study 
(13). The stimuli were single integer numbers. 
The material was presented on a Hull-type 
memory drum. All Ss, except one control group, 
were trained on an original task (Task I) and 
tested on a transfer task (Task II). 

In both tasks the stimulus item for each pair 
was presented for 2.5 sec., followed by a 1.5-sec. 
exposure of stimulus and response together. 
Then the next stimulus appeared with the re- 
sponse window closed, and so forth. The S was 
required to pronounce the three letters, and a 
response was scored correct if it was elicited by 
its stimulus at any time prior to the exposure of 
the response on the memory drum. A complete 
record of all responses was kept by £. 

Subjects. —The Ss were & paid undergraduate 
students obtained through the student employ 
ment office. They were assigned at random to 
10 groups (five degrees of training and two 
forms) as they appeared for the experiment. 

Instructions.—All Sse were given standard 
paired-associate instructions prior to Task I, 
They were also told that in the course of the 
experiment E might stop the procedure in order 
to make some adjustments to the apparatus. 

Transfer conditions.—The Ss learned a four- 
pair training task (Task I) to the various degrees 
of overlearning and then were switched to an 
eight-pair transfer task (Task I), which simul- 
taneously tested transfer effects for four con- 
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TABLE 1 


Stimutt anv Resronses Usep in THe Two Forms or Tasks I anv II 





Tranefer Conditions 
Represented in 
Task Il 


| LFH (50)* 
| KGR (71) | 
| BOX (58) | 
| MJS (54) | 
Il. (New S—Old R) 


| 
| 


. (Old S—New R) 


(Old $—Old R) 


IV. (New S—New R) 





| | 


* Figures in parentheses show the association values obtained from Witmer 
all the syllables used in each condition are: Cond. 


values 
$8.25. 


ditions of transfer. ‘The stimuli and responses 
used in Task I will be called “old” stimuli and 
responses; those first introduced in Task II will 
be called “new.” This results in four transfer 
conditions: 


I. Learning to make a new response to an old 
stimulus. 

Il. Learning to make an old response to a new 
stimulus. 

III. Learning to make an old response to an 
old stimulus when these have not been previously 
paired. This involves the recombination of 
stimuli and responses used in Task I. 

IV. Learning to make a new response to a 
new stimulus. This condition was used as a 
control for such phenomena as the warm-up 
effect demonstrated by Thune (10). 


‘Two forms (A and B) of Tasks I and II were 
used. Table 1 shows the stimuli and responses 
used in both forms of the two tasks. Twelve 
different letters were selected in such a way as to 
provide maximum phonemic differentiation. 
The responses were obtained from Witmer’s 
lists (13). They fell within as narrow a middle 
range of association values as was feasible 
within the other requirements of the design. 
The mean association values (shown in Table 1) 
of the responses used in the various conditions 
did not differ significantly from each other. The 
letter combinations also fulfilled the following 
conditions: (a) Any subsequence of two letters 
that was used in a given response was not used 
in any other response. (6) In Task I, no letter 
was used more than once; in Task II, no letter 


/XSK 


| GBL 


Form B 
Task I! 


K 


| MJS 
| BOX 


2 | GBL (54) | 2 
| FRJ (58)) 5 
XSK 
HMQ (54) | 6 


(63) | 
HMQ (54) | ; 


LFH 


(54) 
(58) 


(50)| 8 


BOX (58)| 6 


MJS (54) | 
KGR 7) | 


(63)'| & | GBL 


| XSK 
| | FRY (58) 
| 
| 


(54) 
(63) 
| HMQ (54) 


9 | LFH (50) 
3 | KGR (71) 


(54) 
FRI (58) | 


(13). The mean association 
I: 57.75, Cond, 11: 58.75, Cond, Il: 56.75, Cond, IV: 


was used more than twice.? (c) No letter 
appeared more than once as either the first, 
second, or third member of a set. (d) In those 
transfer conditions (I and III) where two 
interfering responses are elicited by the same 
stimulus, no letter is common to both responses. 

A “trial” in Task I consisted of a repetition 
of the four S-R pairs. ‘Twelve different order- 
ings of pairs within trials were used, the only 
restriction on the ordering being that the first 
pair of a succeeding trial be different from the 
last pair of the preceding trial. This gave a 
total sequence of 48 pairs (12 trials) which was 
successively repeated. A comparable method 
was used with the transfer task except that with 
eight pairs per trial, six trials completed a run- 
through. 

Degree of training conditions.—The degrees of 
training used in this experiment were the same 
as those used in the motor study. They con- 
sisted of 0, 10, 30, 50, and 100 errorless trials on 
Task I. A different group of Ss was used for 
each of these conditions; these five groups will 
hereafter be designated in terms of the amount 
of training received. 

Group 0 had no training on Task I. In 
Group 10, S was stopped on Task I as soon as 


2A preliminary investigation showed that it 
would not be possible to parallel the motor study 
more exactly by using combinations of only six 
different letters. None of five pilot Ss was able 
to learn a list of eight stimulus response pairs 
where the eight different responses were made 
up of only six different letters. 
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his performance included 10 errorless trials. In 
Group 50, there was a rest for 1.5 min. after 
25 errorless trials, and then S continued until 
he reached the criterion of 50 errorless trials. In 
Group 100, S was treated the same as were 
Group 50 Ss, except that when he finished 50 
errorless trials he was told to come back the 
next day. On the second day, he was given two 
additional blocks of 25 trials each on Task I. At 
the conclusion of the requisite number of error- 
less trials, all Ss were given a 3-min. rest, while 
E changed the tape on the memory drum. They 
then worked on Task II to a criterion of two 
successive errorless trials (16 correct responses) 
or 20 trials, whichever occurred last. Group 0 
proceeded, after identical initial instructions, 
directly to Task II. 


RESULTS 


Training results.—An analysis of 
variance of the training data of all Ss 
who were given Task I, using number 
of trials to the first correct trial, 
showed no significant differences be- 
tween groups. There was a signifi- 
cant difference between Forms A and 
B. Form A had a mean of 17.5 
(SD = 12.5) and Form B a mean of 
10.5 (SD = 6.6) trials to the first 
correct trial. 

Figure 1 shows the mean number of 
errors in successive blocks of trials 
containing 10 correct trials for both 
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Fic. 1. Mean errors per subject to perform 
successive blocks of ten perfect trials, shown as a 
function of level of training (number of correct 
trials achieved). 


forms, which were combined since no 
significant differences were found be- 
tween forms from 20 correct trials on. 
In Fig. 1 all four training groups were 
combined for the first point; Groups 
30, 50, and 100 are combined for the 
second and third point, and soon. It 
can be seen that the number of errors 
reaches asymptotic values by 40 
correct trials. It should be pointed 
out that the mean error values are 
extremely low. For the 50 correct 
trial point the average S made .33 
errors, while giving 40 correct re- 
sponses (four pairs each of 10 correct 
trials). From that point on the error 
percentage is less than 1%, with the 
exception of the block of 10 correct 
trials immediately following the 24-hr. 
rest. Thus, at least for Groups 50 
and 100, overlearning, defined as 
training beyond the establishment of 
stable S-R connections, can be pre- 
sumed to have been established. 
Transfer results.—Two different 
transfer measures, identical to the 
ones used in the previous study, were 
obtained for each S for each of the 
three transfer conditions. Condition 


IV was used as a control. One meas- 


ure, the F-20 score, used the frequency 
of correct responses during the first 
20 trials on Task II. 
quency on the control condition was 


The raw fre- 
subtracted from the frequency on 
each of the relevant transfer con- 
ditions. ‘The other measure, the FCR 
score, was the logarithm of the ratio 
of transfer to control performance in 
terms of the number of the trial on 
which the first correct response oc- 
curred. These measures use Ss as 
their own controls and utilize control 
conditions that are specific to the 
particular form used. 

Analyses of variance of the transfer 
measures showed no significant differ- 
ence associated with the interaction 
between Forms and Groups, or be- 
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Fic. 2. Transfer of training as a function of degree of original training in terms of frequency of 


correct responses in 20 transfer trials (F-20). 
those of the motor study. 


tween Forms, Groups, and Con- 
ditions; the data were therefore 
combined. ‘The analyses did show a 
significant effect due to Conditions 
(P < 001), as well as significant 
interactions between Groups and Con- 


ditions (P < .001). The P values 


presented below refer to the signifi- 
cance of t tests based on mean scores. 
Figures 2A and 3 show the results 


Condition 
1 (OLO S~ NEW A) — 
B (WEW S~- OLD 8) 
@ (OLO $-OLO A) e—wme 


a 
af 
poune 








A, 
r 


TRANSFER ON FIRST CORRECT 


CORRECT TRIALS IN TRAINING 


Fic. 3. Transfer of training as a function 
of degree of original training in terms of at- 
tainment of the first correct response (FCR). 
All transfer effects are positive; the negative 
values are a function of the measure used: 
10 log Transfer score 


Control score 


Fig. 2A shows results of the present study, Fig. 2B 


for the two transfer measures, F-20 
and FCR. Figure 2B shows the 
results of the motor study for com- 
parison purposes.’ In Fig. 2A and 
2B the results for the control group 
(Cond. IV) have also been included, 
but note that the curves for the other 
conditions represent mean differences 
between the particular score for that 
condition and the score on Condition 
IV. As in the motor study, Group 0, 
without previous training, showed 
some differences between the four 
conditions due to differences between 
the S-R pairs used. In order to take 
account of these initial differences 
between the S-R pairs, the data were 
corrected to assume a common origin 
(zero transfer). Comparisons within 
any one transfer condition are un- 
affected by this correction. Statis- 
tical comparisons between transfer 
conditions among training groups 
were controlled for initial difficulty 


* Figure 2B differs slightly from the figure in 
the motor study due to an error in the original 
graph. Cond. I, Group 100, shows some nega- 
tive transfer as in Fig. 2B, rather than zero 
transfer. ‘This drafting error does not affect any 
of the results or discussion in this or the previous 
study. 
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by testing the significance of differ- 
ences in all such comparisons against 
the differences obtained in the cor- 
responding comparisons with Group 0. 

Before discussing the specific trans- 
fer results, one finding should be 
pointed out. In both Fig. 2A and 3 
the curves for all three transfer con- 
ditions show a parallel drop from 
Group 50 to Group 100. Since all 
the transfer data have been corrected 
for Cond. IV, it is obvious from Fig. 
2A that the drop for the three transfer 
conditions can be accounted for com- 
pletely by the rise on the Cond. IV 
(New S—New R) curve. This re- 
lationship also holds for the FCR 
measure. Thus, while no change took 
place for the three transfer conditions 
with an increase of 50 correct trials, 
the facilitative effect of these addi- 
tional trials for the control condition 
makes for an apparent drop in the 
other conditions. No such simple 


explanation holds for any of the other 


findings, where the three transfer 
conditions do not show identical 
changes as a function of the control 
condition; nor is this explanation 
applicable to the motor response 
findings as shown in Fig. 2B. In the 
description of the differential transfer 
results we will therefore not be con- 
cerned with this change from Group 
50 to Group 100, the magnitude and 
direction of which can be directly 
accounted for by the control effects. 

Learning a new response to an old 
stimulus (Cond. I) shows different 
effects for the two measures. For 
F-20 there is an initial negative trans- 
fer effect followed by a return to 
about zero transfer. However, none 
of the differences is significant. For 
the FCR measure we find no negative 
transfer effect at all, but rather a 
significant positive transfer effect from 
0 to 50 (P < .O1) and from 30 to 50 
(P < .02). 


In Cond. II, learning an old re- 
sponse to a new stimulus, we find 
increasing positive transfer as a func- 
tion of increasing training. This 
effect, however, does not reach sta- 
tistically significant proportions until 
the 50 trial point (P < .01 for both 
measures). In this condition 21% of 
the responses in Group 100 were 
correctly given by the second trial, 
i.e., showed one trial learning. 

Learning to make an old response 
to an old stimulus when these have 
not been previously paired (Cond. 
III) shows the highest positive trans- 
fer effects. Positive transfer is al- 
ready significant for Group 10 (P 
< .05 for F-20, and P < .O1 for 
FCR), and remains at that high level 
for the other groups. ‘This condition 
also shows the highest degree of one 
trial learning, i.e., 29%, of 
sponses in Group 100. 

In comparing conditions within 
groups we find that on the F-20 
measure (Fig. 2A) all three conditions 
differ significantly from each other in 
Group 10 at the .02 level or better. 
This difference is maintained until 
Group 50 where Cond. II and III are 
significantly better than Cond. I 
(P < .0O1), but do not differ from 
each other. On the FCR measure 
(Fig. 3) Cond. III is consistently 
superior to the other two conditions 
(P < .O5 or better); Cond. I and II 
differ significantly from Group 30 on. 

In comparing these results with 
those of the motor study, Fig. 2A and 
2B reflect the major differences. Of 
primary interest is the superiority of 
Cond. III (Old S—-Old R) to the other 
conditions in the present study. In 
the motor study Cond. III did not 
show any significant positive transfer 
until Group 100, and for the F-20 
measure it was consistently inferior to 
Cond. II (Fig. 2B). The present 
experiment shows that Cond. III not 


the re- 
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only shows positive transfer from 
Group 10 on, but also is superior to 
Cond. II in Groups 10 and 30, and 
essentially equal to it in Groups 50 
and 100. Apart from this striking 
change in Cond. III there is some 
indication of a difference in Cond. I. 
In the present results we found sub- 
stantially a lack of any significant 
transfer effect. In the motor study 
the F-20 measure also showed no 
significant effects, but for the FCR 
measure negative transfer approached 
statistical significance for Group 30. 
The present data show no negative 
transfer at all for this measure (Fig. 
3). 
The different results of the two 
studies can be summarized as a general 
trend toward positive transfer or 
increase in positive transfer for all 
three conditions in this study, and a 
strikingly superior positive transfer 
effect for Cond. III. 


Warm-up results.—The warm-up 


effect refers to a general improvement 


in performance as a function of in- 
creasing experience in the experi- 
mental situation. Condition IV 
(New S-—-New R) is a direct measure 
of this effect and shows a consistent 
increase as indicated in Fig. 2A. A 
more detailed discussion of the warm- 
up effects in the motor study (7) 
shows that the Group 100 perform- 
ance was inferior to the Group 50 
performance. In the present study 
we find a reverse of this effect, i.e., an 
increase from Group 50 to Group 100, 
While this increase is not significant, 
the difference does account for the 
depressing effect on the three transfer 
conditions. 


Discussion 


Our results show gross similarity to 
the previous findings with motor re- 
sponses (5). The increasing positive 
transfer for Cond. II and III was pre- 


dicted on the basis of response inte- 
gration during training. During training 
parts of complex responses serve as 
stimuli for each other and increasing 
correct repetitions during overlearning 
of these responses increase the inte- 
gration and cohesiveness of the re- 
sponses; the § learns how to perform the 
response. When these responses are 
then required in a new situation they are 
more easily elicited and maintained; § 
need not learn the response complex 
again. The lack of negative transfer in 
Cond. I (Old S—New R) has been 
ascribed to an intervening process. We 
have postulated that overlearning pro- 
duces symbolic analogues of the overt 
response. The symbolic responses are 
most stable when the response is highly 
overlearned, and they make implicit 
trial-and-error behavior possible. This 
intervening process facilitates the inhi- 
bition and elimination of an incorrect 
response before it becomes overt. 

The results of Cond. I can be compared 
with studies by Underwood (11) and 
Atwater (1). In both of these studies, 
which used the A-B, A-C paradigm, the 
results showed increasing facilitation as 
a function of degree of priorlearning. In 
neither of our studies have we found any 
consistent positive transfer for this con- 
dition. Two differences should be 
pointed out. Both Underwood and 
Atwater used meaningful, previously 
practiced responses, namely adjectives 
and three-letter words, respectively. 
Neither of the studies controlled for 
warm-up effects which, as we have 
shown, consistently produce positive 
transfer. A control for this latter factor 
alone would probably significantly de- 
crease their positive transfer findings. 

Condition ITI is consistent with Wylie’s 
(12) and Bruce’s (2) data, and with the 
generalization that learning an _ old 
response to a new stimulus produces 
positive transfer. 

In the discussion of the results of the 
motor study we pointed out (5, p. 416) 
that our findings for Cond. III were 
inconsistent with Gagné, Foster, and 
Baker’s (4) analysis of the “‘stimulus- 
response reversal” condition, and Porter 
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and Duncan’s (8) experimental evidence. 
These investigators had found maximum 
negative transfer for this condition. Our 
present data are even more in contra- 
diction to that finding in that we found 
maximum positive transfer for this con- 
dition. In contrast with our study, 
Porter and Duncan used adjectives as 
stimuli and responses, pretrained their 
subjects to only one correct trial, and 
tested only one transfer condition within 
any one list. However, with familiar 
stimuli, i.e., adjectives, where response 
integration is already established, we 
would have predicted relatively little 
negative transfer. On the other hand, 
little or no response learning takes place 
when such familiar highly practiced 
responses are used, and other, interfering, 
processes may more easily come to the 
fore. It might also be possible that 
transfer effects associated with very high 
degrees of training are distinctively 
different from those with 
relatively lower degrees of learning. 
Finally, in Porter and Duncan’s study 
all S-R pairs in any one list tested either 
the old stimulus-old response or the old 
stimulus-new response variation. Inthe 
present study only two out of eight pairs 
represented each condition. Thus, for 
the old stimulus-old response condition 
the Ss’ previous experience with both 
the stimulus and the response elements 
might have facilitated recognition of this 
relationship, made discrimination § of 
these two pairs easier, and accelerated 
their acquisition. Such an explanation 
of the divergence of the two findings 
would require an additional assumption, 
namely, that the recognition of an old 
stimulus-old response 


associated 


relationship is 


more advantageous for acquisition in a 
heterogeneous list, such as ours, than in 
a homogeneous list, such as Porter and 
Duncan’s. 

One final difference between the two 


overlearning studies should be noted. 
In the first place, transfer effects are more 
pronounced with verbal responses; sec- 
ondly, additional training beyond 50 
correct trials produced no differential 
transfer effect other than that accounted 


for by the increased warm-up facilitation. 


This finding is consistent with the 
generally accepted notion that verbal 
responses are the most highly overlearned 
human responses. Human Ss, particu- 
larly college students, have high degrees 
of experience in the novel recombination 
of letters and should show greater and 
more rapid transfer effects than with the 
assembly of motor responses. In addi- 
tion, fewer trials are apparently neces- 
sary to bring such novel verbal combi- 
nations to an asymptotic level of 
integration. Our finding of significant 
positive transfer effects associated with 
high degrees of overlearning is consistent 
with a recent analysis of overlearning 
and ability by Ferguson (3). Since 
adult human behavior is primarily 
constituted of overlearned responses, 
and associated with symbolic activity, a 
further investigation of these processes 
should shed light on individual 
differences in the performance of familiar 
tasks and responses. 


more 


SUMMARY 


Sixty Ss learned three-place consonant com- 
binations in response to single number integers 
in both a training and transfer task. The 
degrees of training were 0, 10, 30, 50, and 100 
errorless trials on the training task, with 12 Ss 
The task 
simultaneously tested four different conditions, 
three transfer and a fourth 
controlled for warm-up. 


in each of these groups. transfer 


condition which 
The results were 

1. Learning to make a new reponse to an old 
stimulus showed no 
positive transfer effects. 


consistent negative or 

2. Learning to make an old response to a new 
stimulus showed increasing positive transfer as 
degree of original training was increased. 

3. Learning to make an old response to an 
old stimulus, when these had not been previously 
paired, also showed increasing positive transfer 
as original training was increased, but a sig 
nificantly higher degree of positive transfer than 
the New S—Old R condition 


These data were discussed in relation to a 
previous comparable study which used motor 


responses. 
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SIMULTANEOUS AND SUCCESSIVE DISCRIMINATION 
IN A SINGLE-UNIT HOLLOW-SQUARE MAZE 


ALLEN D. CALVIN! AND CLARENCE M. WILLIAMS 


Michigan State University 


In a recent experiment Weise and 
Bitterman (6) compared the relative 
difficulty of simultaneous and succes- 
sive discrimination using a multiple 
discrimination apparatus consisting 
of four hollow-square units. They 
reported that the successive discrimi- 
nation was easier than the simul- 
taneous. Some experimental work 
with children by Calvin (2) indicated 
the possibility that the anomalous 
results obtained by Weise and Bit- 
terman were due to the fact that the 
stimuli were presented in such close 
spatial contiguity that they appeared 
as a single unified percept rather than 
as discrete units. Bitterman and 


Wodinsky (1) have presented a similar 
explanation of the Weise-Bitterman 


findings. However, when this hy- 
pothesis was tested by Calvin and 
Seibel (3), it was not confirmed. The 
simultaneous discrimination remained 
relatively more difficult than the suc- 
cessive even when the stimuli were 
spatially discrete. Calvin and Seibel 
further reported that various factors 
such as goal gradient, alternation, etc. 
were operating differentially in the 
two types of discrimination. The 
present experiment was designed to 
investigate further the relative dif- 
ficulty of simultaneous and successive 
discrimination using only one unit of 
the Calvin-Seibel modified multiple 
discrimination apparatus. 


Metuop 


A pparatus.—Our apparatus duplicated Calvin 
and Seibel’s (3) with the exception that only 


1 Now at Hollins College. 


one unit of the four-unit hollow-square maze was 
used. As in their experiment, two lamps served 
as discriminanda. Each was mounted 8&8 in. 
from the choice point. The starting box was 
8 by 8 in., the food box 8 by 17 in., and the 
alleys were 12 in. long and 4} in. high. The 
apparatus was painted a flat black and the top 
was covered with hardware cloth. The experi- 
mental room was dimly illuminated by an infra- 
red bulb. 

Subjects.—The Ss were 20 naive aibino female 
rats approximately 3 mo. old from the colony 
maintained by the psychology department at 
Michigan State University. Two Ss were 
discarded on the first day because of an error by 
E, atid two spare Ss which had also been handled 
along with the main group were substituted. 
During the preliminary gentling it was dis- 
covered that a male S had inadvertently been 
placed in one of the experimental cages. Al- 
though he was removed immediately, it later 
became apparent that two of the Ss had been 
impregnated. They were discarded and two 
more Ss were substituted for the impregnated Ss. 

Procedure.—The Ss were randomly divided 
into two groups. Group I learned the simul- 
taneous problem and Group II the successive. 
For the simultaneous group either the left or 
right bulb at the choice point was lighted. Half 
the Ss in this group had light positive and half 
dark positive. ‘The position of the positive cue 
was varied according to a predetermined random 
order with the restriction that the same side was 
never correct more than twice in succession and 
each side was correct for half the trials on each 
day. The Ss were given eight trials a day and 
the criterion was two errorless days in succession. 

For the successive group the bulbs at the 
choice point were either both lighted or both 
dark. Half the Ss had to go left to light and 
right to dark and the other half had to go left to 
dark and right to light. The same random 
orders that were used for the simultaneous group 
were employed, and again the criterion was two 
errorless days in succession. 

During the first few days of training some Ss 
occasionally refused to move, and when this 
occurred E prodded them. In no case was 
prodding necessary beyond Day 3. As in the 
Calvin-Seibel and Weise-Bitterman procedure, 
retracing was allowed. For a more detailed 
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description of the procedure see Calvin and 


Seibel (3). 
RESULTS 


Since retracing was allowed, S 
could make more than one error on a 
single trial. Figure 1 shows the 
course of learning in terms of total 
errors. (The curves for initial errors 
were, as would be expected, quite 
similar.) As may be seen, there is no 
marked difference between the two 
groups. When they are compared 
by means-of the Mann-Whitney U 
test (4), the difference between them 
falls far short of significance for both 
total and initial errors. 

In Fig. 2 the results for Group I, the 
simultaneous-group, are broken down 
in terms of those Ss who had light 
positive and those Ss with dark 
positive and these two groups are 
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TABLE 1 


Mean Error Scores 








Total 
Errors 


Initial 
Errors 


Group | 


Simultaneous light 
positive 

Simultaneous dark 
positive 

Successive 


compared with the performance of the 
Ss in Group II in terms of total errors. 
(A breakdown in terms of initial 
errors gave similar curves.) ‘Table 1 
presents mean error scores for the 
successive group and the two simul- 
taneous subgroups. It appears quite 
obvious that the dark-positive simul- 
taneous Ss are markedly superior to 
the successive Ss while the light- 
positive simultaneous Ss are con- 
siderably inferior in performance when 
compared with the successive. The 
following tests of significance were 
made with the Mann-Whitney U test 
(one-tailed): the difference between 
the dark-positive simultaneous and 
the light-positive simultaneous is sig- 
nificant beyond the 1% level for both 
total and initial errors; the difference 
between the dark-positive simultane- 
ous and successive is also significant 
beyond the 1% level for both total 
and initial errors; and the difference 
between the light-positive simultane- 
ous and the successive is significant 
beyond the 5% level for both total 


and initial errors. 


Discussion 


The fact that there was no significant 
difference in performance between 
Groups I and II is in direct opposition 
to the findings of Calvin and Seibel (3) 
and Weise and Bitterman (6), who 
found the successive discrimination to 
be relatively easier than the simul- 
taneous. We used only a single unit 
while Weise and Bitterman and Calvin 





SIMULTANEOUS AND SUCCESSIVE DISCRIMINATION 49 


and Seibel used a multiple-discrimination 
apparatus. The difference between our 
results and theirs supports Spence’s 
contention (5), and the Calvin and 
Seibel findings, that using the multiple- 
discrimination apparatus makes an ade- 
quate analysis of simultaneous vs. suc- 
cessive discrimination extremely difficult. 
However, even with the confounding 
effects of the multiple-discrimination 
apparatus removed, the simultaneous Ss 
were still not superior to the successive 
as would be expected from the approach- 
avoidance position. 

A solution to the problem of the failure 
of Ss of Group II to surpass those of 
Group I is suggested by an experiment 
conducted by Davis.2 Using a delay 
situation in a multiple-discrimination 
apparatus with slightly longer arms, 
Davis found that his dark-positive 
simultaneous Ss’ performance was su- 
perior to that of his successives and his 
light-positive Ss were inferior (although 
not significantly so) to his successives. 
A similar analysis for our data sub- 
stantiated Davis's findings. Weise and 
Bitterman also found that the simul- 
taneous dark-positive Ss were superior 
to the light-positive although the differ- 
ence failed to reach significance. How- 
ever, the performance of their 
simultaneous dark-positive Ss taken by 
itself was significantly inferior to that 
of their successive Ss. An analysis of 
Calvin and Seibel’s data also reveals a 
tendency for the simultaneous dark- 
positive to be superior to the light- 
positive, but as in the case of Weise and 
Bitterman the difference was not sig- 
nificant and the dark- 
positive Ss were significantly inferior 
Thus 
the negative phototropic effect seems to 
be less important in the standard 
multiple-discrimination apparatus prob- 
lem than in our single unit or Davis's 
delay condition. 

If the phototropic effects were exactly 
balanced, i.e., if Ss who had to avoid 
light (dark positive) were aided exactly 


7R. H. Davis. 
April 7, 1955. 


even 


simultaneous 


when compared to the successive. 


Personal communication, 


as much as Ss who had to approach light 
(light positive) were handicapped, then 
the phototropic effect would cancel 
itself out. However, as Davis suggests, 
the assumption of a balanced effect 
seems hardly tenable in the light of 
Spence’s finding (5, p. 91) that when 
confounding factors such as phototropic 
effect, multiple discrimination, etc. were 
removed, the simultaneous discrimi- 
nation was easier than the successive. 
In view of Spence’s results and his further 
comment that “The finding of Weise and 
Bitterman was certainly a surprise to 
the present writer, for in the course of 
the last 15 years a number of studies 
had been run in the Iowa laboratories 
that had tended to suggest that the 
successive problem was relatively the 
more difficult of the two situations,” it 
would seem highly probable that the 
simultaneous light-positive Ss are handi 
capped more than the simultaneous 
dark-positive Ss are aided, which would 
account for the lack of superiority of the 
simultaneous group as a whole when 
they are compared with the successive. 

Since the only demonstrations of the 
relative ease of successive discrimination 
when compared to simultaneous have 
involved extremely complex apparatus 
(the four-unit multiple-discrimination 
apparatus and the multiwindow jumping 
stand) with their concomitant confound 
ing of the analysis, it would seem that 
such demonstrations 


cannot be con 


sidered as crucial proof of the inadequacy 
of the approach-avoidance formulation. 


SUMMARY 


Twenty albino rats were run in a single-unit 
hollow-square maze where lights were used as 
cues as in the Weise-Bitterman Calvin- 
Seibel experiments. Half were presented with 
a simultaneous discrimination and half with a 
successive. There was no significant difference 
in terms of errors between the two groups. A 
further breakdown of the simultaneous group 
into those who had to approach light and those 
who had to avoid light revealed that those who 
had to approach light were significantly inferior 
to the successive animals and those who had to 
avoid light were significantly superior to the 
successive. Implications of these findings were 
discussed. 


and 





50 ALLEN D. CALVIN AND CLARENCE M. WILLIAMS 


REFERENCES 


1. Brrreaman, M. E., & Woninsxy, J. Simul- 
taneous and successive discrimination. 
Prychol. Rev., 1953, 60, 371-376. 

2. Carvin, A. D. Configurational learning in 
children. J. educ. Psychol., 1955, 46, 
117-120. 

3. Carvin, A. D., & Semper, J. L. A further 
investigation of response selection in 
simultaneous and successive discrimi- 
nation. J. exp. Psychol., 1954, 48, 339- 
342. 


5. Spence, K. W. 


. Mann, H. B., & Waitney, D. R. On a test 
of whether one of two random variables 
is stochastically larger than the other. 
Ann. math. Statist., 1947, 18, 50-00. 

The nature of response in 
discrimination learning. Psychol., Rev., 
1952, 59, 89-93. 

. Werse,P., & Birrerman, M. E. Response 
selection in discriminative learning. Psy- 


chol. Rev., 1951, 58, 185-195. 


(Received July 12, 1955) 





Journal of Experimental Psychology 
Vol. 52, No. 1, 1956 


STIMULUS UNITS AND RANGE OF EXPERIENCED STIMULI 
AS DETERMINANTS OF GENERALIZATION-— 
DISCRIMINATION GRADIENTS! 


JACOB L. GEWIRTZ, LYLE V. JONES, AND KARL-~ERIK WAERNERYD 


The Unwersity of Chicago 


The slope and symmetry of the 
typical stimulus generalization gradi- 
ent play important roles in a number 
of behavior theories (4, 5, 9, 12). 
Characteristics of the gradient gen- 
erally are introduced as postulates, 
without direct experimental verifi- 
cation. 

The form of a stimulus generali- 
zation gradient is a function of many 
conditions, some of which have been 
recognized in the literature (cf. 2, p. 
319; 11), but few of which have been 
subjected to formal investigation. 
The purpose of the present study is to 
investigate, in human adults, the 


effect of (a) psychological vs. physical 


units of measurement of the stimulus 
scale, and of (b) central tendency, 
determined by earlier experience with 
the stimulus dimension, upon slope 
and symmetry of empirical gradients 
of response strength. 

By stimulus generalization is meant 
the process whereby stimuli along a 
dimension of similarity with a training 
stimulus, and not associated in prior 
training with its learned response, 
tend to elicit the response. In con- 
trast, by stimulus discrimination is 
meant the process whereby, as a 
function of differential reinforcement, 
stimuli other than the training stimu- 
lus tend not to elicit the learned 
response. There can be postulated a 
dimension of experimental conditions, 
ranging from those which yield pure 
stimulus generalization to conditions 

' This study was supported by a grant from 


the Social Science Research Committee of the 
University of Chicago. 


which yield pure stimulus discrimi- 
nation (1). Gradients of response 
which fall more steeply from a maxi- 
mum at the training stimulus are 
expected as the training conditions 
approach those of discrimination. 
The effects investigated in the present 
study are expected to appear in 
response curves at all points along the 
postulated generalization-discrimina- 
tion dimension, excepting the ex- 
tremes, where curves of minimal or 
maximal slope are expected. Experi- 
mental conditions of the present study 
represent an intermediate position 
along this postulated dimension. 

From the time of Fechner’s work 
in psychophysics, a distinction has 
been recognized between psychological 
and physical stimulus dimensions. In 
its most primitive form, the relation 
between the two might be charac- 
terized by postulating that the psy- 
chological scale (R) is a negatively 
accelerated function of the physical 
scale (S). This postulate is consistent 
with typical findings concerning psy- 
chological and physical units, and 
avoids the dilemma posed by evidence 
that Fechner’s Law (R = k& log S) 
does not hold for all values of S. 

An effective test of the use of 
psychological vs. physical stimulus 
units upon both slope and symmetry 
of response curves requires comparison 
of differential responses to stimuli 
near zero in magnitude with dif- 
ferential responses to stimuli far 
from zero in magnitude. The stimu- 
lus selected, that of visual angle, 
exhibits the essential property of zero 
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magnitude, and has the added ad- 
vantage of allowing for stimuli both 
to the right and left of zero angle, 
equivalent in magnitude, differing 
only in sign. 

By Central Tendency is meant the 
tendency for S’s judgment of a stimu- 
lus magnitude to err in the direction 
of a central magnitude of previously 
experienced stimuli. While in psy- 
chophysical studies central tendency 
phenomena frequently have been con- 
sidered explicitly (cf. 2, pp. 323-335; 
3, 6), it is the rare analysis of gen- 
eralization or discrimination data 
which has taken them into account 
(e.g., 10). 


Meruop 


The voluntary response of “same” or “dif- 
ferent” and a stimulus dimension of visual angle 
are employed. Fach of four experimental 
groups is presented with two tasks: (a) training 
to respond in the center stimulus range to the 
zero angle stimulus, followed by testing for 
response to the training stimulus and to 10 
stimuli symmetrically positioned around zero at 
I-mm. intervals,? and (b) training to respond to a 
stimulus markedly different from zero angle, 
either a positive angle in the right range (+ 50 
mm.) or a negative angle in the left range (— 50 
mm.,), followed by testing for response to that 
training stimulus and to 10 stimuli symmetrically 
positioned at 3-mm. intervals around it. Stimu- 
lus units for center and off-center ranges were 
selected on the basis of a preliminary study, 
results of which confirmed the expectation under 
the postulated relation between psychological 
and physical stimulus dimensions: to insure 
obtaining sufficient evidence concerning forms 
of the response gradients, it is necessary to 
employ a wider segment of the stimulus di- 
mension in the off-center range than in the 
center range. 

The four experimental groups differ both 
according to the order of presentation of the 
tasks, and the position of the off-center training 
stimulus (positive or negative angle, see Table 

*Stimulus intervals are given in terms of 
distance, in millimeters, of the pointer needle 
from its central, zero angle position. Through- 
out the stimulus range, 1 mm, transcends a 
stimulus angle of 0° 1’ 36”. ‘The extremes of 
stimuli employed for any S transcend a visual 
angle falling within the range of foveal vision. 


1). The postulated relation between psycho- 
logical and physical units is expected to yield 
(a) steeper slope of response gradients at the 
center than at the off-center stimulus ranges, 
and (b) skewness of curves within off-center 
ranges in the direction away from zero visual 
angle. Central tendency is expected to affect 
response gradients from the second task, 
yielding curves displaced in the direction of the 
stimulus range in the initial task. 

Apparatus.—The stimulus object was a }-in. 
high movable pointer needle, illuminated by a 
prefocused 2.5-v. bulb. The needle was seen 
by S through a horizontal slot 4 in. by 35 in., 
which was vertically centered at eye level on a 
plywood panel. ‘The pointer needle was visible 
to § through the eyepiece of a right-eye viewing 
tube, 82 in. from the stimulus panel. A shutter 
on the viewing tube was operated by a solenoid. 

Mounted between and directly in line with 
the viewing tube and the base of the pointer 
needle at its most central position were two 
sights, for definition of the zero angle stimulus. 
Each was a luminous sphere, § in. in diameter, 
illuminated by a 2.5-v. prefocused bulb. At 
S’s position were two response keys, one for each 
hand, resting at an inclining angle upon micro- 
switches, 

Procedure.—With hands on the two response 
keys, S, seated in a position which aligned the 
fixed sights with the zero angle stimulus, was 
informed that the “strip of light” might appear 
in any of a number of positions. With shutter 
open, the stimulus was presented in eight 
positions: — 80, — 6, — 40, — 20, + 20, 
+ 40, + 60, and +80 mm. The S was in- 
formed that his “speed of judgment” was to be 
tested, and that his task was to release either 
the right or left response key as soon as the 
shutter was opened, depending upon whether 
the stimulus light was in a position “same as” 
or “different from” the “standard” (training 
stimulus). ‘The response task was reversed for 
half the Ss in each group. (For every analysis 
in this paper, the difference between right- and 
left-hand response keys as indicators of “same” 
responses has been tested. Because in no case 
is statistical significance approached by an 
effect involving response hand, that variable is 
subsequently ignored.) ‘The S’s response served 
to close the shutter, which then remained closed 
between trials. Speed of response was em- 
phasized. 

After five 5-sec. presentations of the standard 
training stimulus, during which S was not to 
respond, there were 12 training trials, 10 
demanding response to the training stimulus 
and 2 (No. 6 and 8) demanding response to 
stimuli 30 mm. to its right (Training | in Table 
1). Without interruption, 55 test trials (Test 1) 
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TABLE 1 


Deraits or Experimentat Desicn 


Step 


‘4 


2 
Training Test 1 Test 2 


Training 2 


10 of 12 
trials to 
zero angle 


55 trials 
to center 
range 


10 of 12 
trials to 
+ 50-mm. 
(to right) 


55 trials 
to right 


angle range 


10 of 12 
trials to 
zero angle 


10 of 12 
trials to 
— 50-mm. 
(to left) 


55 trials 
to center 
range 


55 trials 
to left 


angle range 


10 of 12 
trials to 
+ 50-mm. 
(to right) 


10 of 12 
trials to 
zero angle 


55 trials 
to right 
range 


55 trials 
to center 


angle range 


10 of 12 
trials to 
— 50-mm. angle 
(to left) 


10 of 12 
trials to 
zero angle 


55 trials 
to left 
range 


55 trials 
to center 
range 

















followed, Order of stimuli was randomized with . 
the restriction that every one of the 11 stimulus 
positions within the appropriate stimulus range 
be represented once in each of the five cycles of 


trials. 


would now see the “standard” (training stimu- 
lus) in a position different from those used before. 
The Training 2 and Test 2 procedures which 
then followed were the same as those employed 
for the Training 1 and Test 1 procedures, except 


A 5-min. rest period followed Test 1, after 
which the final procedures instituted 
(Training 2 and Test 2). After S was seated 
and had aligned the sights with the zero angle 
stimulus, he was informed that the “speed of 
judgment” test was to continue, and that he 


were 


5 


that a different stimulus range was involved, 
The sequences of the training and test conditions 
for the four experimental groups are detailed in 
Table 1. 
Subjects.—The Ss were 21 men and 19 women, 
summer-session students at the University of 
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Mean response curves over the three ranges of stimuli, Test 1. 
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Chicago, who were randomly assigned to the 
four experimental groups of 10 Ss each. 


Resutts anp Discussion 


Psychological vs. physical stimulus 
dimensions.—In Fig. 1 appear three 
mean response curves for the initial 


test (Step 2, Table 1) of the LC, RC, 


and combined CL and CR groups, 
based upon WN’s of 10, 10, and 20. 
The mean number of “same” response 
is graphed for each stimulus. For 
illustration of expected effects upon 
slope and symmetry of response 
curves, responses are plotted against 
stimulus magnitude measured on the 
physical dimension. 

The most dramatic difference 
among the curves is one of slope, with 
both halves of the curve for the center 
stimulus range markedly steeper than 
those for the off-center ranges. An 
approximate index of slope for each 
S is taken as the sum of “same” 
responses to the two most extreme 
stimulus values within his stimulus 
range. Considering the median of 
all 40 scores, it is found that 17 of 20 
Ss in the off-center range (LC and 
RC) and 3 of 20 Ss in the center range 
(CL and CR) exhibit scores above 


Mean response curves over the three ranges of stimuli, Test 2. 


that median. An exact test of the 
significance (7) of differences between 
the indices for center vs. off-center 
ranges (NV, = Nz = 20) yields a two- 
tailed P < .OO1, and indicates the 
stability of the differences in slope 
between sets of curves for individual 
Ss. That this result occurs even with 
off-center stimulus units three times 
as large as center stimulus units 
attests to the importance of stimulus 
range as a determinant of the slope 
of response gradients. 

The results of Fig. 1 also appear 
consistent with the prediction re- 
garding skewness of response curves. 
The right half of the LC curve appears 
to drop with steeper slope than the 
left half, and the reverse is true of the 
RC curve. For individual Ss, 7 of 
the 10 in the LC group and 6 of the 
10 in the RC group exhibit the pre- 
dicted direction of skewness. 

Central Tendency.—The independent 
variables considered with respect to 
Central Tendency are: (a) Order 
(whether a given stimulus range was 
presented as the initial or final con- 
dition); (6) Position (11 stimuli in 
each stimulus range employed); and 
(c) Cycle (one of five successively 
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presented blocks of I1 trials). In 
Fig. 2 are presented the mean curves 
generated in Test 2 by each of the 
four experimental groups. A com- 
parison of these four curves with the 
three curves generated in Test l, as 
presented in Fig. 1, appears to indicate 
that the mean response gradient for 
each of the groups is a function of its 
prior experimental experience with 
stimuli. 

To simplify the analysis of dif- 
ferences between mean response gradi- 
ents, there is tested the assumption 
that the LC and RC groups and the 
CL and CR groups should differ only 
as mirror images. After signs of the 
nominal stimulus positions are re- 
flected (reverséd) in both stimulus 
ranges for members of the RC group 
(henceforth labeled RC), and in the 
off-center stimulus range for members 
of the CR group (henceforth labeled 
CR), there are tested the significance 
of mean differences between LC and 
RC (in both ranges) and between CL 
and CR (off-center), for all dependent 
variables to be treated in subsequent 
analyses. For this test, use is made 
of the analysis of variance design for 
repeated measurements on the same 
Ss, labeled Type I by Lindquist (8, 
pp. 267-273). Because in no case did 
differences approach statistical sig- 
nificance, for subsequent analyses the 
four experimental groups are com- 
bined into two, reflecting order of 
conditions: the LC and RC group 
and the CL and CR group. 

Because Position is defined in 
terms of different units in the center 
and off-center stimulus ranges, two 
Type I analyses of variance, one for 
each stimulus range, are carried out 
on the frequency of “same” response 
over five cycles at each stimulus 
position, to determine the effect of 
Central Tendency. The possible 


range of individual scores is 0 to §, 
All 11 stimulus positions are included 
in the analysis for the off-center range. 
Because of low-frequency scores for 
the three positions at both extremes 
in the center range (— 5, — 4, — 3; 
and 3, 4, 5 mm.), assumptions under- 
lying the analysis of variance are not 
met, and those data are excluded 
from the center range analysis, though 
the positions are represented in the 
figures. 

Table 2 indicates, both for the 
central five stimulus positions of the 
center range, and for the 11 stimulus 
positions of the off-center range, that 
the Position by Order interaction 
effects which would reflect Central 
Tendency are significant. (The com- 
bined mean curves which were in- 
volved in the tests are not presented 
formally, but may be developed 
readily from the component curves 
appearing in the figures.) The off- 
center mean response curve for Ss 
having prior experience with the 
center range of stimuli (CL and CR, 
Fig. 2) is displaced systematically 
toward the center range, as contrasted 
to the off-center mean response curve 


TABLE 2 


Summaries or ANALYSES OF VARIANCE 
ror Position Errects 


| c ( 
Center Kange ~ — 
Source 


Mean 


| Mean 
| Square 


Square 


Order 
Error (b) 
Total between-Ss 


2.4 
2.86 


Position 

Position K Order 
Error (w) | 
Total within-Ss 


10 | 24.15** 
10 | 11.65** 


Total 


* Vields F significant beyond 05 level 
** Vields F significant beyond 001 level 
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of Ss having no prior experience in the 
experiment (LC and RC, Fig. 1). 
For the five stimulus positions in the 
center range, the mean response curve 


of the LC and RC group is displaced 
systematically in the direction of the 
off-center stimulus range involved in 
their prior experience, when con- 
trasted with the mean response curve 
for the CL and CR group which had 
no prior experimental experience. 
The significant Position effects found 
in both ranges indicate that stable 
response gradients were obtained by 
individuals. 


In the preceding analysis, the possible im- 
portance of Cycle has been ignored because the 
variable, frequency of “same’’ response, could 
take only the values zero or one for each test 
stimulus in each cycle, precluding the inclusion 
of Cycle and Position as independent variables 
in the same analysis. Cycle could effect Central 
Tendency as a result of either of two compen- 
satory processes; its reduction due to increasing 
experience with the second stimulys range, or 
its enhancement due to negative time-order 
error (2, pp. 305-311). To study the effect of 
Cycle there is established a dependent variable 
which can take any one of a number of possible 
values: the arithmetic mean stimulus position 
value for each cycle, calculated only from those 
stimulus positions which called forth “same” 
responses. In two distinct Type I analyses of 
variance, one for each stimulus range, neither 
Cycle nor Order by Cycle effects approach 
significance. ‘To the extent that the crude index 
is adequate, it is unlikely that systematic dif- 
ferences in performance over cycles account for 
the obtained results. 


SUMMARY 


The aim of the present experiment was to 
investigate the effects of psychological vs. 
physical stimulus units and central tendency, 
determined by earlier experience with the 
stimulus dimension, upon generalization-dis- 
crimination gradients, A stimulus dimension 
of visual angle and a voluntary motor response 
were employed. 

Four experimental groups, each of 10 adult 
human Ss, were employed, Each S was pre- 
sented with two tasks: (a) training to respond 
to a zero angle stimulus, followed by testing for 


response to the training stimulus and to 10 
stimuli symmetrically positioned around zero at 
intervals of 1’ 36” (in the center stimulus range); 
and (b) training to respond to a nonzero angle 
stimulus, either a positive angle (in the right 
stimulus range) or a negative angle (in the left 
stimulus range) markedly different from zero, 
followed by testing for response to the training 
stimulus and to 10 stimuli symmetrically posi- 
tioned around it at intervals of 4’ 48”. The 
four groups differed with respect to: the order 
of the tasks presented (whether a given stimulus 
range was presented first or second); and the 
position of the off-center training stimulus 
(either positive or negative). 

Major results are as follows: (a) As expected 
from the assumed relationship between psycho- 
logical and physical stimulus units, individual 
response gradients for Ss initially presented 
stimuli in the center range exhibited steeper 
slopes (P < .001) than those for Ss initially 
presented an off-center stimulus range. (b) As 
expected from the Central Tendency concept, 
response gradients of individual Ss, both in the 
center and off-center stimulus ranges, were 
displaced significantly in the direction of their 
previous experience with the stimulus dimension 
(P < 05, P < 001). 
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The present study deals with the 
relationship between learning and 
motivation. Cast in terms of Hull’s 
system an attempt was made to 
determine whether s/n = f(D). In 
a recent paper Spence (7) outlined the 
conditions necessary for a successful 
test of this relationship and indicated 
the shortcomings of certain previous 
studies. No extension of his analysis 
will be made here although the lit- 
erature indicates a great deal of 
confusion and contradictory data. 

The complete factorial design, a 
necessary condition for this test, as 
outlined by Spence, was utilized. 
Thus three groups of rats were trained 
under one of three levels of hunger 
deprivation, respectively. At the 
completion of training each main 
group was divided into three sub- 
groups, which were then tested under 
one of the drive conditions of the 
three main groups. Such a design 
yields a 3 & 3 table, the rows repre- 
senting drive during initial learning, 
the columns drive during the test. If 
the scores which are entered are test 
scores, and drive differences along the 
rows are equated through the factori- 
zation of the drive variable, any 
difference in the row means must 
represent some residual effect of the 
initial training. This difference, if 
present, is therefore a reflection of 
learning (s//p). 


Mertruop 


Apparatus.—The discrimination apparatus 
used in this experiment was adapted from Grice 
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(3). It was of }-in. plywood, covered with 
}-in. hardware cloth. The design and dimen- 
sions were as shown in Fig. 1. ‘The starting 
boxes, choice area, and adaptation goal boxes 
were painted a flat, intermediate gray. The two 
outer goal boxes of the training unit were 
painted in white enamel, and the middle box in 
flat black. All of the guillotine doors were 
gray with the exception of those surfaces facing 
the inside of the black and the white boxes, 
respectively. These surfaces followed the color 


of the box. 


ADAPTATION 
BOXES 





+ 


black} 12 


4 


2 
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LEARNING 
END-BOXES 








k |white 
‘ 


CHOICE 

AREA 
STARTING 

BOX 


4 


+ 
9 





| 
Fa 


Fic. 1. Discrimination apparatus. The 
black-white-black boxes were replaced by the 
gray boxes during adaptation. 





In addition to the general room illumination, 
a @&-w. lamp was centered 48 in. above the goal 
boxes. 

Subjects—The Ss were hooded, male rats 
from the colony maintained at the University 
of California, Los Angeles, and were between 90 
and 120 days at the start of the experiment. 
Sixty Ss were started but 7, 6, and 4 were lost 
from the 4-, 22-, and 46-hr. drive groups, re- 
spectively. The results, threfore, are based on 
data obtained from 43 Ss. 

Maintenance.—Three levels of food depri- 
vation (4-hr., 22-hr., and 454-hr.) were utilized. 
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On the day Ss were taken from the stock house, 
they were divided into three equal groups, at 
random, and assigned to one of these drive 
levels. For the first 24 hr. all Ss were fed 
Friskies pellets (which were used as the reward 
in the subsequent phases of the procedure) ad 
libitum. On Day 2 the maintenance schedule 
was begun. The 22-hr. and 46-hr. groups were 
fed for 1 hr., 1 hr. after the time that their 
experimental sessions were subsequently begun. 
One-half hour was allowed for running time and 
a .5-hr. interval between the end of running and 
the beginning of feeding was imposed. A 
period of 22 hr. elapsed between the end of 
feeding and the beginning of the experimental 
session for the 22-hr. group. The 46-hr. group, 
of course, was fed only every other day and 
therefore 46 hr. elapsed between these two 
points. The 4-hr. group was fed for 1 hr. 
beginning 5 hr. before the scheduled running 
time. Four hours elapsed between the end of 
feeding and the start of running. To equalize 
intervals between experimental sessions (since 
the 46-hr. group could only be run once during 
a 48-hr. period) both the 4hr. and the 22-hr. 
groups were also run only every other day. 

On Day 22 of the procedure, four days before 
the learning phase began, the schedule was 
interrupted for 24 hr. of ad libitum feeding to 
allow some relief from the severe deprivation 
imposed on the 46-hr. group. ‘To equalize con- 
ditions, however, the ad libitum feeding was 
extended to the other two groups as well. 
Figure 2 provides a graphic summary of the 
maintenance schedule. 

Adaptation.—The adaptation phase covered 
a period of 25 days. On Day 1 the Ss were 
taken from stock, divided into three random 

















L 
24 
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Fic. 2. Maintenance schedule. The 1-hr: 
feeding period is indicated by F and the 1I-hr. 
experimental period by R. 


groups as described above, and placed in their 
home cages, five Ss to a cage. Handling, 
marking, and institution of the maintenance 
schedule were accomplished on Days 2 through 
8. Handling of a particular S took place only 
during that period which was set aside for the 
subsequent experimental treatment of that S, 
e.g., the 4-hr. group was handled 4 hr. after 
feeding. 

One-half hour of group exploration of the 
discrimination apparatus was given on Days 9, 
Il,and 13. The gray end boxes were used. On 
Days 9 and 11 there was no food in the ap- 
paratus. On Day 13 there was a quantity of 
food in the cup in each box. Free trials were 
started on Day 15 and continued throughout the 
adaptation period. On Day 15, 2 trials were 
given, then 3, 6, 6, and 6 on Days 17, 19, 21, 
and 24, respectively. ‘The S was placed in the 
starting box, the guillotine door raised, regardless 
of orientation, and the door to the end box 
lowered as soon as S was entirely in the box, 
i.¢., a response was considered completed only 
when all four feet were within the box. ‘Time 
was taken from the lifting of the starting door 
to the lowering of the end-box door. Food was 
in both cups during these trials. During the 
first seven trials the large quantity of food in the 
cup was gradually reduced to one pellet, which 
remained constant from then on. 

Original learning.—-The black-white  dis- 
crimination learning was begun on Day 26 
The procedure was the same as during adap- 
tation except that the black-white boxes were 
used and a pellet was placed in only the white 
cup. Six trials per day were given for all 
groups. A decided black preference was ob 
served in pilot work and therefore all Ss were 
trained white-positive during this phase. The 
criterion of learning was 14 correct out of 16 
trials. The right-left placement of the reward 
and therefore the placement of the white box 
was randomly determined. ‘Trials were rotated 
with the intertrial interval approximately 5 
min. The Ss were detained in the end box for 
20 sec. on an incorrect run. 
procedure was adopted. The Ss were kept in 
individual retaining cages between trials. ‘These 
were different from both the home cage and the 
end box in constructioti, color (brightness), etc. 

Drive shift.—After the criterion was reached 
on the original problem, drives were shifted and 
Ss were trained to the reversal, ic, black 
positive. Between the white-positive and black 
positive training Ss were fed ad libitum for 48 
hr. and then adapted to the new drive for 48 
hr.—one-third of Ss retained on old drive, one- 


A noncorrection 


‘third shifted, randomly, to each of the other 


drives and reversal training to black-positive 
was to the same criterion as in original learning 
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TABLE 1 


Perrormance Measures IN 
OnicinaL Learninc 








Reinforce- 
ments 


25.6 
27.1 
20.5 


Drive N Errors Trials 


45.5. 
444 
32.3 


4-hr. 
22-hr. 


46-hr. 119 








Tests of Significance 
Errors: 
toes = 1.18; P >: 
toro = 2.40;P <. 


83;P>. 


F = 6.69;P < Ol 
Reinforcements : 
tou 
F=7.79;P < Ol 
Trials: 


F = 8.26; P < 01 


J31;P>. 
teow ™ 3.33; P<! 


ten = 








Four measures were obtained during reversal 
training: errors, reinforcements, trials to the 
criterion, and resistance to extinction (n). This 
measure was defined as the number of trials on 
which S returned to the formerly correct side 
(white) before the first run to the other side 
(black). 


RESULTS 


Data for the initial learning are 
presented in Table 1. A summary of 
the various tests of significance is also 
included. With respect to each of 
the three measures, a simple analysis 
of variance indicated a significant 
difference among groups. The proba- 
bility values are all beyond the .01 
level. The ¢ tests shown were based 
on the use of the common error esti- 
mate of the variance analysis. It 
may be seen that the 4 and 22-hr. 
groups did not differ significantly with 
respect to any one of the three meas- 
ures. The 46-hr. group did differ, 
however, from each of the other 
groups at the .03 level or better. 
Thus, the 46-hr. group required fewer 
errors, trials, and reinforcements to 
reach the criterion than either of the 
other two groups. 


Data on the reversal learning are 
presented in Table 2. This table is 
composed of five subtables, each 
arranged in the form of a 3 X 3 table 
with the columns representing drive 
during reversal and the rows drive 
during original learning. All cell as 
well as border entries are mean values 


TABLE 2 


Perrormance Measures IN 
Reversat Learninc 








Reversal Drive 
Original 
Drive 





46-hr. 


4-hr. | 22-hr. 





Number of Cases 





4-hr. 
22-hr. 
46-hr. 





Total 











| 








21.0 
25.0 
20.2 


22.0 


Trials 





35.2 
48.4 
41.2 


42.0 





Extinction ( 
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except for the subtable indicating the 
number of cases. In the latter, the 
borders indicate the total number of 
Ss in each of the three main groups 
during reversal and original learning, 
respectively, and the cells the number 
of Ss in each of the nine subgroups 
composing these main groups. 

Four separate analyses of variance! 
were performed for each of the re- 
sponse measures. None of the F’s 
associated with the column main 
effects were significant, nor were any 
of the column-by-row interactions. 
Thus, drive during reversal had no 
differential effect on learning. The 
row main-effects analyses did, on the 
other hand, reveal a significant dif- 
ference for one of the response meas- 
ures. As in the case of the column 
data, neither errors (F = 2.01, P 
> .05), reinforcements (F = 1.23, P 
> .05), nor trials (F = .70, P > .05) 
varied during reversal as a function 
of the drive during original training. 


Resistance to extinction, however, did 
reflect differences as a function of 
original drive (F = 16.5, df = 2 and 


34, P < Ol). Resistance to extinc- 
tion between groups trained under 4- 
and 22-hr. of drive was not significant 
(t = 1.49); whereas, the differences 
between the 46-hr. group and each of 
the other two were significant at 
beyond the .O1 level (too « = 3.98). 
This was similar to the results of 
original learning. In both cases the 
4- and 22-hr. groups did not differ on 
any one of the measures whereas the 
46-hr. group did differ from both on 
each of the obtained indicators. 


Discussion 


The first point that bears discussion 
is the lack of difference between the 4- 


! The 3 X 3 analysis involved disproportional 
subgroup frequencies. Correction for dispro- 
portionality was accomplished by the method of 
Walker and Lev (10). 


and 22-hr. groups during both original 
and reversal learning. A _ possible ex- 
planation involves the maintenance pro- 
cedure. Both groups were fed 1 hr. 
during a 24-hr. period. The 4-hr. group 
was run 4 hr. after feeding, the 22-hr. 
group was run 22 hr. after feeding. Both 
were deprived for a period of 23 hr. out 
of 24. If drive were a function of two 
parameters, the total deprivation time 
and hours of deprivation at the time of 
running (4) and éA reached an asymptote 
at or before 4 hr. then both the 4-hr. and 
the 22-hr. groups would have had the 
same degree of drive and no differences 
would have been expected. That some 
sort of two-parameter definition of drive 
is necessary is indicated by work now in 
progress (2). 

The interpretation of the effects of 
drive differences during reversal on 
reversal learning, as presented in Table 
2, is relatively simple. Lack of sig- 
nificance on any one of the response 
measures indicates that drive during 
reversal had no differential effect on 
learning. Why it should be ineffective 
during reversal learning when it was 
effective during original learning is a 
more difficult question. It may have 
resulted from the fact that at the be- 
ginning of reversal training, all Ss had 
fairly strong incorrect habits as a result 
of the original training. The multi- 
plication of these incorrect habits by 
high D could cause more interference 
than when multiplied by low D. The 
46-hr. group on this basis would therefore 
be expected to do more poorly. It was 
seen, however, that high D had a facili- 
tating effect on performance during 
original learning, and from this point of 
view the 46-hr. groups should have done 
better than the lower drive groups on 
the reversal learning. These two factors, 
one facilitating, the other retarding, may 
have canceled each other and thus 
produced apparent equality of learning 
with respect to drive differences during 
reversal. 

The interpretation of the effects of 
drive differences during original learning 
on reversal learning presents a more 
complicated picture. Neither errors, nor 
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reinforcements, nor trials during reversal 
yielded significant differences as a func- 
tion of drive during original learning, 
indicating the independence of habit 
strength and drive. Differences between 
means on resistance to extinction, how- 
ever, were significant at beyond the .01 
level, and this was primarily due to the 
greater resistance to extinction of Ss 
trained under the 46-hr. drive. Since 
the groups were equated with respect to 
drive during reversal, only two factors 
could account for this difference. If Ss 
trained originally under 46-hr. depri- 
vation had acquired more sHp for white 
than either of the other groups, this 
would be expected. According to Hull’s 
most recent formulation (5), sHr is 
assumed to be a function of the number 
of reinforcements. Referring to Table 
1, it is seen that the 46-hr. group actually 
received significantly fewer reinforce- 
ments than the other groups. This 
should therefore have led to the develop- 
ment of less gHpr and consequently a 
weaker nm. That this was not the case 
is here explained by the possibility that 
sHpr and D are not independent. If the 
increments in s/, were greater under the 
higher levels of D than under lower, then 
the 46-hr. group could have acquired 
more gsHpr for white during original 
learning, making their reversal learning 
more difficult, i.e., producing greater n 
for the 46-hr. group. The second factor 
that might have accounted for the 
“slower” learning of Ss trained under 
46-hr. deprivation, originally, is con- 
ditioned inhibition (s/r). If the 46-hr. 
Ss developed more g/px for the black 
during original training, then they would 
have tended to avoid the shift to black 
after reversal. Referring to Table 1, it 
may be seen that the 46-hr. group had 
fewer errors, and also fewer total trials 
to criterion than either of the other two 
groups, implying that, if anything, their 
slr for black was less than for either the 
4- or 22-hr. groups. This factor, then, 
should have worked against the direction 
of the obtained results. It was also the 
case that trials were spaced during the 
original as well as reversal phases, 
minimizing the accumulation of gsJp. 


It is concluded therefore that sHer is 
functionally related to the drive under 
which it is developed. 

The question of why errors, reinforce- 
ments, and trials did not differ signifi- 
cantly, whereas m did differ can be 
answered only speculatively, on the basis 
of the results of this study. It may 
simply be that these measures are not 
sensitive enough to reflect small differ- 
ences. Another more attractive possi- 
bility is that once the initial extinction 
occurs, there are no longer any differences 
to be measured. The groups, it should 
be remembered, were equated for all 
variables except drive during original 
training. If the effects of drive during 
original learning are equalized during 
the initial period of reversal learning, 
i.e., during the 3.6, 4.7, and 7.6 successive 
errors at the beginning of reversal, then 
no differences in learning after that 
period should be expected. This, of 
course, was the case. The extreme 
similarity in relearning can more easily be 
observed if the extinction errors are 
subtracted from the total errors or from 
the trials to relearn. The scores for 
errors after this subtraction for the 4-, 
22- and 46-hr. groups, respectively, are 
12.0, 13.7, and 13.1. For trials they are 
33.4, 36.5, and 33.3. Considering the 
small N and the usual variability of rat 


data, the differences between these 
figures are indeed small. 
This interpretation is further sup- 


ported by the results of a recent study 
(1). Using a straight alley, high- and 
low-drive groups of rats were given 14 
training trials to food. After training, 
each group was divided into two sub- 
groups and extinguished under one of 
the two drive levels of the training 
period, respectively. Differences in 
trials to extinction, as a function of 
drive during training, were not signifi- 
cant. Differences in running speed at- 
tributable to drive during training, for 
the first six extinction trials, however, 
were significant. The latter data would 
indicate, therefore, the dependence of 
habit strength on the drive during 
learning, whereas the former would 
indicate the independence of the two 

















constructs. Two interpretations were 
offered as a means of explaining this 
ambiguity. One questioned the validity 
of Hull’s use of both amplitude (running 
speed) and resistance to extinction as 


alternative measures of response 
strength. The second suggested that 
the high-drive Ss learned responses 


which were different from those learned 
by Ss trained under low drive. It is 
felt that the interpretation given in the 
present study is more parsimonious than 
the two offered by Campbell and 
Kraeling. Referring again to that study, 
it was reported that the running-speed 
differences during extinction were most 
disparate during the first few extinction 
trials, and it was shown that by Trial 10 
there were practically no differences 
between Ss trained under high and low 
drive, respectively. In other words, 
differences between groups were only 
apparent during the initial stages of 
extinction. If a more severe criterion 
of extinction were employed and/or a 
greater number of trials than the first 
six extinction trials were used, it is 
probable that even the amplitude meas- 
ure (running speed) would have yielded 
insignificance. The point is that the 
amplitude measure yielded significance 
not because it measured a different 
aspect of response strength from trials 
to extinction or that different responses 
were learned under high as compared 
with low drive, but that the amplitude 
measure reflected differences due to 
training drive level which were only 
present during the early stages of the 
extinction period. In certain other 
studies of similar design (4, 6, 8, 9), 
trials to extinction were also used and 
yielded negative results. That is to Say, 
no differences were obtained, and it was 
concluded that habit was independent 
of drive. The implication of the Camp- 
bell and Kraeling interpretation would 
be that the lack of agreement between 
their study and the others is attributable 
to the use of an amplitude measure in 
theirs and a resistance-to-extinction 
measure in the others. It is suggested 
here, however, that the lack of agree- 
ment is due to the use of a measure which 
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in the one case reflected differences 
during the early phases of extinction 
and in the other cases a measure which 
was insensitive to these intial differences. 
It is not that amplitude and resistance 
to extinction measure reaction potential 
differently or that they measure different 
aspects of it, but that they measure or 
reflect reaction potential at different 
points in the learning process. 


SUMMARY 


Three groups of hooded, male rats were 
trained to run to white in a black-white dis- 
crimination apparatus under 4, 22, and 46 hr. of 
food deprivation, respectively. After reaching 
the criterion of learning, on white-positive, 
drives were shifted and the Ss retrained to 
black-positive. The drive shift involved the 
division of each main group into three subgroups, 
each corresponding to one of the drive conditions 
of the three main groups, respectively. 

No significant differences were obtained 
between the 4- and 22-hr. groups either during 
the original, white-positive or during the reversal, 
black-positive learning phases. An hypothesis 
involving a two parameter definition of drive 
was offered to explain this lack of difference 

The 46-hr. group was significantly superior 
to the lower drive groups during original learning 
with respect to errors, reinforcements, and trials 
to the criterion of learning. The 46-hr. group 
showed greater resistance to extinction than 
either of the low-drive groups during the re 
versal phase, although it did not differ with 
respect to errors, trials, or reinforcements to the 
criterion of relearning. This was explained on 
the basis of the initial extinction of differences 
at the beginning of reversal learning and it was 
that habit strength 
functional relationship to drive during learning 


concluded bears some 
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